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Single Chain Analogs f the TGF-B Soperfamilv fM rohons^ 

Background of the Invention 

This invention relates to the design and production of sing}e-diain biosynthetic constructs 
herein caUed "morphons" ^ch mimic the activities of one or more members of the TGF-p 
superfiunily. 

The TCT-p superfamily includes five distinct forms of TGF-P (Spom and Roberts (1990) in 
S Peptide Gfowtfi Factors and Thdr Receptors. Spom and Robots, eds., Springer-Verlag: Berlin 
pp. 419^72), as weU as the differentiation fiurtors vg-1 (Wedcs and Mdton (1987) £^51: 861- 
867),DPP-C polypeptide (Padgett et al. (1987) Nature 325: 81-84), the hormones activinand 
inhibm (Mason et al. (1985) Nature 318: 659-663; Mason et al. (1987) Growth Factors 1: 77-88), 
the Mullerian-inhibiting substance, MIS (Gate et al. (1986) CeU 45:685-698), osteogenic and 

10 morphograic proteins OP-1 (PCT/IIS90/05903), OP-2 (PCT/US9 1/07654), OP-3 

(PCTAVO94/10202), the BMPs, (see U.S. Patent Nos. 4,877,864; 5,141,905; 5,013,649; 
5,116,738; 5,108,922; 5,106,748; and 5,155,058), the devdopmentally regulated protein VGR- 
1 (Lyons et al. (1989) Proc, £jatL Acad. Sd. USA 86: 4554-4558) and the growth/differentiation 
factors GDF-1, GDF-S, GDF-9 and dorsalin-1 (McPherron gt al. (1993) I Bid. Chem . 268 : 

1 5 3444-4449; Basler et al. (1 993) Cdl 73 : 687-702). 

The proteins of the TGF-3 superfamily are disulfide-linked homo- or heterodimers that are 
expressed as large precursor polypeptide chains containing a hydrophobic signal sequence, a long 
and relatively poorly conserved N-terminal pro region of several hundred amino acids, a cleavage 
site, and a mature domain comprising an N-terminal region which varies among the family 

20 members and a more highly conserved C-terminal region. This C-terminal region, present in the 
processed mature proteins of all known family members, contains approximately 100 amino acids 
with a characteristic cysteine motif having a conserved sbc or seven cysteine skeleton. Although 
the position of the cleavage site between the mature and pro regions varies among the family 
members, the cysteine pattern of the C-terminus of all of the proteins is in the identical format, 

25 ending in the sequence Cys-X-Cys-X (Spom and Roberts (1 990), supra V 

Recombinant TGF-pl has been cloned (Derynck et ^. (1985) Nature 316 : 701-705), and 
expressed in Chinese hamster ovar>' cells (Gentry et al. (1987) Mol. Cell. Biol. 7: 3418-3427). 
Additionally, recombinant human TGF-p2 (deMartin et al. (1987) EMBO I 6: 3673), as well as 
human and porcine TGF-P3 (Deiynck et al. (1988) EMBO J 7: 3737-3743; Dijkeetal. (1988) 
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Proc. Natl Acftd, So. USA 85: 4715), have been cloned. Expression levels of the mature TGF- 
pl protein in COS cells have been mcreased by substituting cysteine residues located in the pro 
region oftheTGF-pl precursor with serine residues (Brunncr fit rf. (1989) I BioL Chem. 264 : 
13660-13664). 

A unifying feature of the biology of the proteins of the TGF-P superfamily is their ability to 
r^;ulate developmental processes. These structurally related proteins have been identified as 
being involved in a variety of developmental events. For example, TGF-3 and the polypq)tides of 
the inhibin/activin group appear to play a role in the reguhdon of cell growth and differaitiation. 
MIS causes regression of the Mullerian duct in development of the mflmmal ifln male embryo, and 
dpp, the gene product of the Drosophila decapentaplegic complex, is required for appropriate 
dorsal-ventral specification. Similarly, Vg-1 is involved in mesoderm induction in Xraopus, and 
Vgr-1 has been identified in a variety of developing murine tissues. Regarding bone formation, 
many of the proteins in the TGF-3 supergene family, namely OP-1 and a subset of the BMPs 
apparently play the major role. OP-1 (BMP-7) and other osteogenic proteins have been produced 
using recombinant techniques (U.S. Patent No. 5,01 1,691 and PCT Application No. US 
90/05903) and shown to be able to induce formation of true endochondral bone in wo. BMP-2 
has been recombinantly produced in monkey COS-1 cells and Chinese hamster ovary cdls (Wang 
etal. (1990)P!BcJM.AcaA Sd.lJSA87:2M^^ 

Recently the family of proteins taught as having osteogenic activity as judged by the 
Sampath and Reddi bone formation assay have been shown to be morphogenic, i.e., capable of 
indudng the developmental cascade of tissue morphogenesis in a mature mammal (See PCT 
Application No. US 92/01968). In particular, these proteins are capable of inducing the 
proliferation of uncommitted progenitor cells, and inducing the difiR^entiation of these stimulated 
progenitor cells in a tissue-specific manner under appropriate environmental conditions. In 
addition, the raorphogens are capable of supporting the growth and maintenance of these 
differentiated cells. These morphogenic activities allow the proteins to initiate and maintain the 
developmental cascade of tissue morphogenesis in an appropriate, morphogenically permissive 
environment, stimulating stem cells to proliferate and differentiate in a tissue-specific manner, and 
inducing the progression of events that culminate in new tissue formation. These morphogenic 
activities also allow the proteins to induce the "redifferentiation" of cells previously stimulated to 
stray fi-om their differentiation path. Under appropriate envirormiental conditions it is anticipated 
that these morphogens also may stimulate the "redifferentiation" of committed cells. 

The tertiary and quaternary structure of both TGF-P2 and OP-1 have been determined. 
Although TGF-P2 and OP-1 exhibit only about 35% amino acid identity in their respective amino 
acid sequences the tertiary and quaternary structures of both molecules are strikingly similar. 
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Both TGF-P2 and OP- 1 are dimeric in nature and have a unique folding pattern involving six of 
the seven C-terminal cysteine readues, as ilhistrated in Figure lA. Figure 1 A shows that in each 
subunit four cysteines bond to generate an eight readue ring, and two additional two cysteine 
residues form a disulfide bond that passes through the ring to form a knot*4ike structure. With a 
5 numboing sdieme beginning with the most N-tenninal cysteine of the 7 conserved cysteine 
residues assigned number 1, the 2nd and 6th cysteine residues bond to close one ade of the eight 
residue ring while the 3rd and 7th cysteine residues close the other side. The 1 st and 5th 
conserved cysteine residues bond through the center of the ring to form the core of the knot. The 
4th cysteine forms an interchain disulfide bond with the corresponding residue in the other 
10 subunit. 

The TGF*32 and OP-1 monomer subunits comprise three major structural elements and an 
N-terminal r^on. The structural elements are made up of regions of contiguous polypeptide 
chain that possess over 50% secondary structure of the following types: (1) loop, (2) helix and 
(3) P-sheet. Furthermore, in these regions the N-terminal and C-terminal strands are not more 

15 than 7 A apart. The residues between the 1st and 2nd conserved cysteines (Fig. 1 A) form a 

structural region characterized by an anti-parallel P-sheet finger, referred to berdn as the finger 1 
npgion (Fl). A ribbon trace of the finger 1 pq>tide backbone is shown in Fig. IB. Similariy the 
residues between the 5th and 6th conserved cysteines in Fig. 1 A also form an anti-parallel P-sheet 
finger, referred to herein as the finger 2 r^on (F2). A ribbon trace of the finger 2 peptide 

20 backbone is shown in Fig. ID. A p*sheet finger is a single amino acid chain, comprising a p- 
strand that folds back on itself by means of a P-tum or some hirger loop so that the entering and 
exiting strands form one or more anti-parallel p-sheet structures. The third major structural 
region, involving the residues between the 3rd and 4th conserved cysteines in Fig. 1 A, is 
characterized by a three turn a-helix referred to herein as the heel region (H). A ribbon trace of 

25 the heel peptide backbone is shown in Fig. 1 C. 

The organization of the monomer structure is similar to that of a left hand where the knot 
region is located at the position equivalent to the palm, finger I is equivalent to the index and 
middle fingers, the a-helix is equivalent to the heel of the hand, and finger 2 is equivalent to the 
ring and small fingers. The N-terminal region (not well defined in the published structures) is 
30 predicted to be located at a position roughly equivalent to the thumb. 

In the dimeric forms of both TGF-P2 and OP-1 , the subunits are oriented such that the heel 
region of one subunit contacts the finger regions of the other subunit with the knot regions of the 
connected subunits forming the core of the molecule. The 4th cysteine forms a disulfide bridge 
with its counterpart on the second chain thereby equivalently linking the chains at the center of the 
35 palms. The dimer thus formed is an ellipsoidal (cigar shaped) molecule when viewed fi-om the top 
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iooking down the two-fold axis of symmetry between the subunhs (Fig. 2A). Viewed from the 
side, the molecule resembles a bent "ctgar^ since the two subunits are oriented at a slight angle 
relative to eadi other (Fig. 2B). 

U.S. Patent Nos.: 5,132,405; 5,091,513; and 5,258,498 and PCX Application No. 
5 PCTAJS88/01 737 disclose how to make single-cfaain bindiAg proteins whidi mimic the structure 
of an immunoglobulin Fv region by linking the C- and N-tennini of lig^t- and heavy chain variable 
r^on domains, and bow to make multifimctionat protdns by linking together separate protein 
domains which fiinction either indepradendy or in concert. U.S. Patent Nos,: 4,704,692; 
4,881,175; 4,939,666; 4,946,778; and 5,260,203 disclose computer-based techniques for 
10 selecting a peptide linker sequences for connecting separate protdn chains to produce a single- 
chain protein. 
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Summary fthclnvcnti n 

The invention provides a family of single-chain constructs of the TGF-p superfiunily 
(hereinafter called "morphons") which mimic the physiological eflFects of one or more members of 
the superfemity. Specifically, the morphon constructs of the invention bind preferentially to a 
5 natural cdl surfece receptor that typically interacts with a TGF-p superfamily member, and the 
morphon, upon binding with the receptor initates a cascade of events that would occur when the 
TGF-P superfamily m«iiber binds to the receptor. 

The morphon constructs differ fi-om the natural TGF-p superfamily members in that they are 
single-chain proteins which preferably are e>q)ressed from a single DNA in a host cell. The 

1 0 natural members of the TGF-p superfemily are dimeric structures wherein the monomer subunits 
are held together by non-covalent interactions or by one or more disulfide bonds. The TGF-p 
superfamily members are inactive as monomers. In contrast, the morphon constructs comprise a 
functional monomer subunit and, therefore, are believed to be more stable than the natural dimers, 
particularly under redudng conditions. In addition, the morphon constructs may have a molecular 

1 S weight significantly lower than the natural dimers and thus are likely to diffuse fester and be 
cleared by the body &ster than natural superfamily menibers. 

The morphon constructs preferably are manufactured in accordance with the principles 
disclosed herein by assembly of nucleotides and/or joining DNA restriction fragments to produce 
synthetic DNAs. The DNAs are transfected into an appropriate protein expression vehicle, the 

20 encoded protein expressed, folded if necessary, and purified. Particular constructs can be tested 
for agonist activity in vitro . The tertiary structure of the candidate morphon constructs may be 
iterativdy refined and binding modulated by site-directed or nucleotide sequence directed 
mutagenesis aided by the principles disclosed herdn, computer-based protein structure modeling, 
and recentiy developed rational drug design techniques to improve or modulate specific properties 

25 of a molecule of interest. Known phage display or other nucleotide expression systems may be 
exploited to produce simultaneously a large number of candidate constructs. The pool of 
candidate constructs subsequently may be screened for binding specificity using, for example, a 
chromatography column comprising surface immobilized receptors, salt gradient elution to select 
for, and to concentrate high binding candidates, and in vitro assays to determine whether or not 

30 particular isolated candidates agonize the activity of the template superfamily member. 

Identification of a useful construct is followed by production of cell lines expressing commercially 
useful quantities of the construct for laboratory use and ultimately for producing therapeutically 
useful drugs. It is contemplated also that preferred single-chain constructs, once identified and 
characterized by the recombinant DNA methodologies described above, may be produced by 

35 standard peptide synthesis methodologies. 
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It has now been discovered bow to design, make, test and use single-chain amino add 
constructs comprising an amino add sequence which, wh» properly folded, assume a tertiary 
structure defining a finger 1 r^on, a finger 2 r^on, and a hed region vMch together are 
conq>lementary to the ligand binding interactive sur&ce of a TGF*p superfitmily member receptor. 
The constructs, upon binding with the recqitor agonize the activity of a TGF-^ superfiunily 
member. In one important subset of onbodiments, the constructs agonize initiation of cellular 
diSerentiation and tissue morphogenesis, e.g., initiate cell transformation leading to new tissue 
formation, such as, bone formation. The constructs comprise an amino acid sequence su£Sdently 
duplicative of the amino arid sequence of a TGF-P superfamily member such that it preferentially 
binds the cognate receptor for that member. 

All of the morphon constructs of the invention conq>rise regions of amino add sequences 
defining the three r^ons required for utility, namdy, finger 1, finger 2, and hed regions, and 
additional linker sequences which join these regions, maintain them in their proper conformation 
individually, and maintain their rdative positions and orientations in space. Sequences for the 
finger and heel regions may be copied fi-om the respective finger and heel region sequences of any 
known TGF-P superfamily member identified herein. Alternatively, the finger and heel regions 
may be selected firom the amino acid sequence of a new member of this superfamily discovered 
hereafter using the principles disdosed herdnbdow. 

The finger, heel, and linker sequences also may be altered by amino acid substitution, for 
example by exploiting substitute amino arid residues selected in accordance with the prinriples 
disclosed in Smith et al. (1990) Proc. Nat}. Acad . Sd. USA 87: 1 18-122, the disdosure of which 
is incorporated herein by reference. Smith et al. disclose an amino acid class hierarchy, similar to 
the amino acid hierarchy table set forth in Figure 10, which may be used to rationally substitute 
one amino acid for another while minimizing gross conformational distortions of the type which 
othenvise may inactivate the protdn. In any event, it is contemplated that many synthetic finger 
1, finger 2, and heel region sequences, having only 70% homology with natural regions, 
preferably 80%, and most preferably at least 90%, may be used to produce active morphon 
constructs. It is contemplated also, as disclosed herein, that the size of the constructs may be 
reduced significantly by truncating the natural finger and heel regions of the template TGF-P 
superfamily member, while compensating for dimensional changes using linkers as disclosed 
below. 

The linker sequences, as described herein, join and maintain the spatial relationship of the 
finger and heel regions within a monomeric subunit. The linker sequences typically comprising 
about 3-13 amino acids serve to maintain, for example, the cysteine structural motif, namely, the 
cysteine pattern and knot structure (cysteines 1, 2, 3, 5, 6. and 7, and their respective spatial and 
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bonding rdationship) which characterize, and are believed to be essential for maintainifig the 
tertiary structure of the members of the superfamily. Principles and methods for sdectii^ 
appropriate poIypq)tide linko* sequences are disclosed hereinbdow. 

More specifically, the invration provides a functional morphon construct that is constituted 
5 by a finger 1, a finger 2, and a heel region, joined together by peptide linkers having, e.g., 3-13 
amino adds, and optionally including an N-tenninal sequence upstream of the first cysteine 
b^inning the sequence of the finger 1 region. In the smgle-chain morphon constructs of the 
invention, the finger 1, finger 2 and heel regions together define a structure complementary to the 
ligand binding surface of a TGF-P supofiunily member receptor structure and are suffidently 
10 duplicative of a sequence of a member of the TGF-P superfamily sudi that the construct 
preferentially binds the recqitor. Where the finger 1 re^on is deagnated Fl , the fmger 2 
designated F2, and the heel designated H, the constructs of this type can take the form of one of 
the constructs set forth bdow, and indude: 

FMinker-F2-Unker-H; 
15 Fl-linker-H-linker-F2: 
F2-linker.FMinker.Hi 
F2-linker-H-linker.Fl 
H-linker-Fl-linker-F2; and 
H-linker-F2-linker.Fl. 

20 Generic and specific sequences of amino acids which constitute the finger I, finger 2, heel 

regions, and N-terminal sequences are disclosed hereinbelow. 

The invention further comprises DNAs encoding the morphon constructs of the invention, 
cell lines transformed with the DNAs, and methods of making the morphons by culturing 
transformed cells to produce them followed by purification. 



25 



The invention may be understood further, and various of its objects and features better 
appreciated by referring to the drawings, description, sequence listing, and claims which follow. 
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Brief Dcscripti n of the Drawings 

Figure 1 A is a simplified line drawing usefiil in describing the structure of a monomeric 
subunit of a TGF-p superfamily member. See the Background of the Invention, supra> for 
e^qplanatioa Figures IB, IC, and ID are monovision ribbon tracings of the respective peptide 
5 badd>ones of typical secondary structures of the finger 1, heel, and finger 2 regions. 

Figures 2 A and 2B are stereo peptide backbone ribbon trace drawings ilhistrating the generic 
three-dimensional shape of TGF-P superfamily monber protein dimer: A) fi^om the "top" (down 
the two-fold axis of symmetry between the subunits) with the axes of the helical hed r^ons 
generally normal to the paper and the axes of each of the finger 1 and finger 2 regions generally 
10 vertical, and B) fi-om the "side" with the two-fold axis between the subunits in the plane of the 
paper, with the axes of the heels generally horizontal, and the axes of the fingers generally 
vertical. The reader is encouraged to view the stereo alpha carbon trace drawings in wall eyed 
stereo to understand better the q^atial relation^ps in the morphon design. 

Figures 3 A and 3B are respective schematic representations of a natural mraiber of the TGF- 
IS p super&mily and a morphon construct of the invention. 

Figures 4A and 4B are stereo peptide backbone ribbon trace drawings of a half molecule or 
morphon construct viewed fi'om the top (same convention as used in Figure 2) and side, 
respectively. 

Figures 5A, 5B, and 5C are single letter code hstings of amino acid sequences, arranged to 
20 indicate aligrmients and homologies of the finger 1, heel, and finger 2 regions, respectively, of the 
currently known members of the TGF-p superfamily. Shown are the respective amino adds 
compri^g each region of human TGF-Pl through TGF-PS (the TGF-p subgroup), the Vg/dpp 
subgroup consisting of dpp, Vg-1. Vgr-1, 60A (see copending U.S.S.N. 08/271,556). BMP-2A 
(also known in the literature as BMP-2), dorsalin, BMP-2B (also known in the literature as BMP- 
25 4), BMP-3, BMP-5, BMP-6, OP-1 (also known in the literature as BMP-7), OP-2 (see 

PCT/US91/07635 and U.S. Patent No. 5,266,683) and OP-3 (U.S.S.N 07/971,091), the GDP 
subgroup consisting of GDF-1, GDF-3, and GDF-9, the Inhibin subgroup consisting of Inhibin a, 
Inhibin PA, and Inhibin PB. The dashes (-) indicate a peptide bond between adjacent amino acids. 
A consensus sequence pattern for each subgroup is shown at the bottom of each subgroup. 

30 Figure 6 is a single letter code listing of amino acid sequences, identified in capital letter in 

standard single letter amino acid code, and in lower case letters to identify groups of amino acids 
useful in that location, wherein the lower case letters stand for the amino acids indicated in 
accordance with the pattern definition key table set forth in Figure 10. Figure 6 identifies 
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preferred pattern sequences for constituting the finger I, heel, and finger 2 regions of biosyntbetic 
constructs of the invention. The dashes (-) indicate a peptide bond between adjacent amino acids. 

Figures 7A, 7B, and 7C are sin^e letter code listings of large, medium, and small peptides, 
respectively, comprising 24 (laige), 20 (medium) or 12 (small) amino acids embodying truncated 
S versions of finger 1 regions useful in constructing morphon constructs of the invention. Also 
disclosed in Figures 7A, 7B, and 7C are sequence patterns identifying specific amino add residues 
(upper case letters in accordance with the standard single letter amino acid code) or groups of 
amino add residues (lower case letters in accordance with the prindples described in Figure 10) 
that are useful at particular locations in the finger 1 region or the tnmcated versions thereof. 

10 Figures 7D, 7E, 7F, and 7G are stereo peptide backbone ribbon trace drawings defining the 

whole TGF'Pl finger 1 region, and the large, medium, and small finger 1 pq)tide sequences, 
respectively. The stereo ribbon trace drawings of Figs. 7E, 7F, and 7G correspond to the TGF- 
02 peptide sequences set forth in Figs. 7A, 7B, and 7C, respectively. 

Figures 8 A, 8B, and 8C are single letter code listings of large, medhrni, and small peptides 
15 comprising amino add sequences embodying tnmcated versions of heel regions usefiil in 

constructing morphon constructs of the invention. The "large" sequence comprises as few as 14 
amino acids (e.g., for Inhibin a) and up to 18 (e.g., for Inhibin PA). The "medium" peptide 
comprises between 1 1 and 1 5 amino adds, and the "small" peptide between 6 and 9 amino acids. 
Pattern sequences are given for each subgroup identifying specific amino acid residues (upper 
20 case letters in accordance with the standard single letter amino acid code) or groups of amino acid 
residues (lower case letters in accordance with the principles described in Figure 10) that are 
useful at particular locations in the heel region or the truncated v^ions thereof 

Figures 8D, BE, 8F, and 8G are stereo peptide backbone ribbon trace drawings defining the 
whole TGF-P2 heel re^on, and the large, medium, and small heel peptide sequences, respectively. 
25 The stereo ribbon trace drawings of Figs. 8E, 8F, and 8G correspond to the TGF-32 peptide 
sequences set forth in Figs. 8A, 8B, and 8C, respectively. 

Figures 9A, 9B and 9C are single letter code listings of large, medium, and small peptides, 
respectively, embodying truncated versions of some finger 2 regions useful in constructing 
morphons of the invention. The "large" sequence comprises as few as 16 amino acids (e.g., for 
30 the members of the TGF-3 subgroup) and up to 18 amino acids (e.g., for Dorsalin). The 

"medium" peptide comprises between 12 and 14 amino acids, and the "small" peptide between 8 
and 12 amino acids. Also disclosed are pattern sequences identifying specific amino acid residues 
(upper case letters in accordance with the standard single letter amino acid code) or groups of 
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amino add residues (lower case letters in accordance with the prindples described in Figure 10) 
that are useful at particular locations in the finger 2 region or the tmncated versions thereof. 

Figures 9D, 9E, 9F, and 9G are stereo peptide backbone ribbon trace drawings definmg the 
whole TGF*P2 finger 2 region, and the large, medium, and small finger 2 peptide sequences, 
respectivdy. The stereo ribbon trace drawings of Figs. 9E, 9F, and 9G correspond to the peptide 
sequences set forth in Figs. 9A, 9B, and 9C, respectivdy. 

Figure 10 is a pattern definition table prepared in accordance with the teaching of Smith and 
Smith (1990^ Proc. Natl. Acad. Sd. USA 87:1 18-122. 

Figure 1 1 is a single letter code listing of aligned amino add sequences corresponding to a 
natural OP-I monomer (denoted OP-l/BMP-7) and an OP-1 morphon (denoted M-OPl). For 
convenience, the natural OP-1 and OP-I morphon sequences are shown as separate finger 1, hed, 
and finger 2 regions. The additional linkers connecting these three domains in the OP-1 morphon 
are denoted as linker 1 and linker 2. 

Further particulars concerning the drawings are disclosed in the following description which 
disdoses details of the structure of single chain morphogens and morphons, how to make them, 
how to test thdr activity, how to use them, and provides exemplary constructs. 
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Detailed Description of the Invention 

The instant invention provides a family of single-chain morphon constructs based upon 
members of the TGF-P superfemily that bind prefwentially, either in vivo or in vitro, to a natural 
ceU surface receptor complementary to a TGF-P superfemily member. The single-chain morphon 
5 constructs upon binding to the receptor agonize or mimic the activity of the TGF-p super&mily 
member. 

The invention disclosed herein is based on a combination of insights extrapolated from data 
in the literature and data generated by the inventors and their coworkers. The inventors studied 
the homologies between structures and sequences of currently known members of the TGF-p 
10 superfamily, made observations and comparisons of the activity and structural properties of 
truncated and mutated analogs of certain members of the superfamily, and developed in-dq)th 
knowledge of how to design, make, and use peptide hnkws to produce biologically active single- 
chain proteins by linking naturally separate protein domains. 

Structural Features TGF-B2 and OP*l, 

1 5 The design of the single-chain biosynthetic constructs of the invention has been facilitated by 

modeUng structures based upon the tertiary and quaternary structures of both TGF-P2 and OP-1 . 
Although the amino acid sequences of TGF-P2 and OP-1 exhibit only about 36% amino acid 
sequence identity, when taken together with the fact that TGF-p2 and OP-1 exhibit diflFerent 
physiological activities it is striking that both the tertiary and quaternary structures of these 

20 molecules are substantially the same 

Each of the subunits in either TGF P2 or OP-1 have a characteristic folding pattern, 
illustrated schematically in fig. 1 A, that involves sfac of the seven C-terminai cysteine residues. 
Briefly, four of the cysteine residues in each subunit form two disulfide bonds which together 
create an eight residue ring, while two additional cysteine residues form a disulfide bond that 

25 passes through the ring to form a knot-like structure. With a numbering scheme beginning with 
the most N-terminal cysteine of the 7 conserved cysteine residues assigned number 1, the 2nd and 
6th cysteine residues are disulfide bonded to close one side of the eight residue ring while the 3rd 
and 7th cysteine residues are disulfide bonded to close the other side of the ring. The 1st and 5th 
conserved cysteine residues are disulfide bonded through the center of the ring to form the core of 

30 the knot. Amino acid sequence alignment patterns suggest this structural motif is conserved 
between members of the TGF-P superfamily. The 4th cysteine is semi-conserved and when 
present typically forms an interchain disulfide bond (ICDB) with the corresponding cysteine 
residue in the other subunit. 
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The structure of each subunit in TGF-P2 and OP-1 conqnise three nuyor tertiary structural 
dements and an N-terminal region. The structural dements are made up of regions of contiguous 
polypeptide diain that possess over 50% secondary structure of the following types: (1) loop, 
(2) hdix and (3) p-sheet. Another defining criterion for each structural region is that the entering 
5 (N-tenninal) and exiting (C-terminal) peptide strands are &irly close together, being about 7 A 
apart. 

The amino add sequence between the 1st and 2nd conserved cystemes, as shown in Fig. 1 A, 
form a structural region characterized by an anti-parallel P-sheet finger referred to ho-ein as the 
finger 1 region. Similarly the residues between the 5th and 6th conserved cysteines, as shown in 

10 Fig. I A, also form an anti-parallel p-sheet finger, referred to herein as the finger 2 region. A p- 
sheet finger is a single amino acid chain, comprising a P-strand that folds back on itself by means 
of a P-tum or some larger loop so that the polypeptide chain entering and exiting the region form 
one or more anti-parallel P-sheet structures. The third nu^or structural region, involving the 
residues between the 3rd and 5th conserved cysteines, as shown in Fig. 1 A, is characterized by a 

IS three turn a-helix, referred to herein as the heel region. The organization of the monomer 

structure is similar to that of a left hand where the knot region is located at the position equivalent 
to the palm, the finger 1 region is equivalmt to the index and middle fingers, the a-hdix, or bed 
region, is equivalent to the heel of the hand, and the finger 2 region is equivalent to the ring and 
small fingers. The N-terminal region whose sequence is not conserved across the TGF-P 

20 superfamily is predicted to be located at a position roughly equivalent to the thumb. 

Monovision ribbon tracings of the alpha carbon backbones of each of the three major 
independent struaural elements of the TGF-P2 monomer are ilhistrated in Figures IB- ID. 
Specifically, an exemplary finger 1 region comprising the first anti-paralld p-sheet segment is 
shown in Fig. IB, an exemplary hed region comprising the three turn a-helical segment is shown 
25 in Fig. IC, and an exemplary finger 2 region comprising second and third anti-parallel P-sheet 
segments is shown in Fig. ID. 

Fig. 2 shows stereo ribbon trace drawings of the peptide backbone of the conformationally 
active TGF-P2 dimer complex. The two monomer subunits in the dimer complex are oriented 
symmetrically such that the heel region of one subunit contacts the finger regions of the other 

30 subunit with the knot regions of the connected subunits forming the core of the molecule. The 
4th cysteine forms an interchain disulfide bond with its counterpart on the second chain thereby 
equivalently linking the chains at the center of the palms. The dimer thus formed is an ellipsoidal 
(cigar shaped) molecule when viewed fi'om the top looking down the two-fold axis of symmetry 
between the subunits (Fig. 2A). Viewed fi-om the side, the molecule resembles a bent "cigar" 

35 since the two subunits are oriented at a slight angle relative to each other (Fig. 2B). 
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As shown in Fig. 2, each of the stnictund elements which together define the native 
monomer subunits of the dimcr are labeled 22, 22', 23, 23*, 24, 24', 25, 25', 26, and 26*, wherein, 
elements 22, 23, 24, 25, and 26 are defined by one subunit and elements 22*, 23', 24', 25', and 2ff 
belong to the other subumt Spedfically, 22 and 22* denote N-tenninal domains; 23 and 23' 
5 denote the finger 1 regions; 24 and 24' denote heel r^ons; 25 and 25' denote the finger 2 

regions; and 26 and 26' denote disulfide bonds which connect the 1st and 5th conserved cysteines 
of each subunit to form the knot-like structure. From Fig. 2, it can be seen that the heel region 
from one subunit, e.g., 24, and the finger 1 and finger 2 regions, e.g., 23' and 25', respectively 
from the other subunit, intact with one another. These three elements co-operate with one 
10 other to define a structure interactive with, and complimentary to the ligand binding interactive 
surface of the cognate receptor. 

Single-Chain Biosvnthetic Constructs of the Invention fMorohons^, 

As mentioned above, the morphon constructs differ from the natural TGF-P superfamily 
monbers in that they are single-chain proteins expressed preferably from a single DNA in a host 

1 5 cell. Natural TGF-P superfamily members are dimoic molecules wherein the subunits are held 
together by non-polar interactions and/or one or more inter-chain disulfide bonds. The TGF-P 
sup^amily members are only active in thdr dimeric conformation. In contrast, the morphons are 
active as a monom^ subunit and comprise in the single subunit, two finger regions which 
normally would belong to one subunit of the natural dimer and a heel region which normally 

20 would belong to the other subunit of the natural dimer. It is, therefore, contemplated that the 
biosynthetic constructs of the invention may be more stable, particularly under reducing 
conditions, than the natural morphogen dimers. In addition, it is contemplated that the 
biosynthetic morphon constructs may have a molecular weight substantially lower than natural 
biologically active TGF-p superfamily members, and therefore may diffuse faster, and be cleared 

25 by the body faster than the natural super&miiy members. 

The differences between functionally active TGF-3 superfamily members and morphons are 
shown schematically in Fig. 3. For example. Fig. 3 A shows a dimeric TGF-P superfamily member 
wherein one subunit defining a finger 1 (33), finger 2 (35), heel region (34), and a knot region 
(36) is represented by the completely filled motif and the other subunit defining a finger 1 (33'), 
30 finger 2 (35'), a heel region (34'), and a knot region (36') is represented by the stippled motif 
Figure 3B shows a morphon construct defining a finger 1 (33'), a finger 2 (35'), and a heel region 
(34), however, the finger 1 and finger 2 regions, as defined by the stippled motif, typically are 
derived from one subunit of a dimeric TGF-P superfamily member and the heel region, as defined 
by the completely filled motif, typically is derived from the other subunit of the TGF-P 
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supei&nily member. In the moq)hon construct, the finger and bed regions are connected by two 
polypq>tide linkers (38 and 39). 

The morphon constructs of the invention comprise a finger I region, a heel region and a 
fingo- 2 region, joined together by polypeptide linkers comprising, for example, about 3-13 amino 
S adds, and optionally including an N-terminal sequence upstream of the first cysteine beginning the 
sequence of the first finger region. Where the finger 1 region is designated FI , the finger 2 r^on 
is de^gnated F2, and the hed r^on is designated H, the constructs of this type can take several 
forms and inchide, for «ample: 

Fl -Unker-F2-linker-H; 
10 Fl-hnker-H-linker.F2; 

F2.hnker-Fl-Unker-H: 
F2.1inker-H.Unker-Fl 
H-linker-FMinker-F2; and 
H*linker-F2-linker-Fl . 

15 Fig. 4 shows stereo ribbon trace drawings of the peptide backbone of an exemplary morphon 

construct, wherein Figs. 4A and 4B show top and side views of the morphon, respectively, using 
the same convention as described in Fig. 2. Each of the structural dements of the exemplary 
morphon are labeled 42, 43, 44, 45, 46, 47, 48, and 49. Label 42 denotes the N-terminal domain, 
labd 43 denotes the finger 1 region, label 44 denotes the heel region, label 45 denotes the finger 2 

20 region, labels 46 and 47 denote polypeptide linkers which link the finger and hed r^ons, and 
labded 48 and 49 doiote disulfide bonds. In this figure, the disulfide bond 48 links an N-tenninal 
part of the finger 1 region to an N-tenninal part of the finger 2 region and the disulfide bond 49 
links a C-^terminal part of the finger I region to a C-terminal part of the finger 2 region. 

Without wishing to be bound by theory, it is contemplated that hydrophobic interactions 
25 between the finger and heel elements may stabilize the tertiary structure of each morphon. It is 
contemplated also that engineered disulfide bonds may likewise stabilize the folded structure of 
the morphon construct. It is understood that the skilled artisan need not determine a priori the 
precise locations of the disulfide bonds when designing the constructs because morphogen 
constructs having optimal stability and biological activity may be isolated using conventional 
30 screening methodologies, such as those discussed hereinbelow. 

It is contemplated that the amino acid sequences defining the finger and heel regions may be 
copied fi-om the respective finger and heel region sequences of any known member of the TGF-P 
superfamily. Furthermore, it is contemplated also that the size of the morphon constructs of the 
invention may be reduced significantly by truncating the natural finger and heel regions of the 
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• template TGF-P superfiunily member, while compensating for dimensional changes by using 
polypeptide linko* sequences. Truncated forms of the morphon constnicts are referred to herein 
as "mini morphons". 

I. Selection ofFinger and Heel Regions 

5 As mentioned above, amino add sequences defining finger and heel regions may be copied 

either fiiom the respective finger and heel region sequences of any known TGF-P superfamily 
member, identified herein, or firom amino acid sequences of a new superfamily member discovered 
hereafter. 

Fig. 5 summarizes the amino add sequences of currently identified TGF*3 super&mily 
10 members aligned into finger 1 (fig. 5A), hed (Fig. 5B) and finger 2 (Fig. 5C) regions. The 

sequences were aligned by a computer algorithm which m order to optimally align the sequences 
inserted gaps into r^ons of amino add sequence known to define loop structures rather than 
regions of amino add sequence known to have conserved amino add sequence or secondary 
structure. For example, if possible, no gaps were introduced into amino add sequences of finger 
15 1 and finger 2 regions defined by P sheet or heel regions defined by a helix. The dashes (-) 
indicate a peptide bond between adjacent amino acids. A consensus sequence pattern for each 
subgroup is shown at the bottom of each subgroup. 

After the amino acid sequences of each of the TGF-p superfiunily membm were aligned, the 
aligned sequences were used to produce amino acid sequence alignment patterns which identify 
20 amino acid residues that may be substituted by another amino acid or group of amino acids 
without altering the overall tertiary structure of the morphon construct. The amino adds or 
groups of amino acids that may be useful at a particular position in the finger and hed regions 
were identified by a computer algorithm implementing the amino acid hierarchy pattern structure 
shown in Fig. 10. 

25 Briefly, the algorithm performs four levels of analysis. In level I, the algorithm determines 

whether a particular amino acid residue occurs with a frequency greater than 75% at a specific 
position within the amino acid sequence. For example, if a glycine residue occurs 8 out of 10 
times at a particular position in an amino acid sequence, then a glycine is designated at that 
position. If the position to be tested consists of all gaps then a gap character (-) is assigned to the 

30 position, otherwise, if at least one gap exists then a "z" (standing for any residue or a gap) is 
assigned to the position. If, no amino acid occurs in 75% of the candidate sequences at a 
particular position the algorithm implements the Level II analysis. 
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Levd n defines pattern sets a,b, d,l,k,o,ii,i,aiKih, i»iierein 1, k, and o share a common 
amino add residue. The algorithm then determines whether 75% or more of the amino add 
residues at a particular portion in the amino add sequence satisfy one of the aforementioned 
patterns. Ifso, then the pattern is assigned to that position. It is possible, however, that both 
patterns 1 and k may be simultaneously satisfied because they share the same amino add, 
q>edfically aspartic add. If simultaneous assignment of 1 and k occurs then pattern m (Level m) 
is assigned to that position. Likewise, it is possible that both patterns k and o may be 
simultaneously assigned because they share the same amino add, specifically ghitamic add. If 
simultaneous assignment of k and o occurs, then pattern q (Level lU) is assigned to that position. 
If ndther a Level n pattern nor the Level III patterns, m and q, satisfy a particular position in the 
amino add sequmce then the algorithm implements a Level m ana^s. 

Level HI defines pattern sets c, e, m, q, p, and j, wherdn m, q, and p share a common amino 
residue. Pattern q, however, is not tested in the Level HI analysis. It is possible that both 
patterns m and p may be simultaneously satisfied because they share the same amino acid, 
specifically, glutamic acid. If simultaneous assignment of m and p occurs then pattern r (Levd 
IV) is assigned to that position If 75 % of the amino acids at a pre-selected position in the 
aligned amino add sequences satisfy a Levd HI pattern, then the Level III pattern is assigned to 
that portion. If a Levd ED pattern cannot be assigned to that position then the algorithm 
implements a Levd IV analy^s. 

Level IV comprises two non-overiapping patterns f and r. If 75% of the amino acids at a 
particular position in the amino acid sequence satisfy a Level IV pattern then the pattern is 
assigned to the position. If no Level IV pattern is assigned the algorithm assigns an X 
representing any amino add (Level V) to that position. 

In Fig. 10, Level I lists in upper case letters in single amino acid code the 20 naturally 
occurring amino acids. Levels II-V define, in lower case letters, groups of amino acids based 
upon the amino acid hierarchy set forth in Smith etal., supra. The amino acid sequences set forth 
in Figs. 5 through 9 were aligned using the aforementioned computer algorithms. 

It is contemplated that if the artisan wishes to produce a morphon construct based upon 
currently identified members of the TGF-p superfamily, then the artisan may use the amino acid 
sequences shown in Fig. 5 to provide the finger 1 , finger 2 and heel regions useful in the 
production of the morphon constructs of the invention. In the case of members of the TGF-p 
superfamily discovered hereafter, the amino acid sequence of the new member may be aligned, 
either manually or by means of a computer algorithm, with the sequences set forth in Fig. 5 to 
define heel and finger regions useful in the practice of the invention. 
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Table I bdow iBiinmyriy/^ publications which describe the amino add sequences of each 
TGF-fi super&mily member that woe used to produce the sequence alignment patterns set forth 



in Figs. 5-9. 

Table 1 . 

S TGF-P SEQ. ID.No. Publication 
Super&mify 

Member 

TGF'fi] 1 Denmck et al. (1987) Nucl. Adds. Res. 15:3187 

TGF-32 2 Burt et al. (1 99 1 ) DNA Cell Biol. 10 :723-734 

10 TGF-P3 3 Ten Diike et al. (1988) Proc. Natl. Acad. Sd. USA 85:4 715-4719: 

Derynck rt al. (1988) EMBOJ. 7 :3737-3743. 

TGF-P4 4 Burt ^ al- (1992) Mol. Endcrinol . 6:989-922. 

TGF-P5 5 Kondaiah ^ si. (1990) J. Biol. 0!an265: 1089-1093 

dpp 6 Padgett^s!. (1987) 2^ature325:81-84; Paganibanetai. (1990) 

1 5 Mai. Biol. J0:2669-2677. 

vg-1 7 Weekset^. (1987) Cell 51:861-867 

vgr- 1 8 Lyons et al. (1 989) Eroc. Nat}. Acad . Sd USA 86:4554-4558 

60A 9 Wharton etal (1991) Proc. Natl Acad . Sd. USA 88:9214-9218: 

Doctor Ma!. (1992) D^. Biol. 111:491-505 

20 BMP-2A 10 Woznev et al. (1988) Sdence 242 : 1528-1534 

BMP-3 11 Woznev et al. (1988) Sdence 242 : 1528-1534 

BMP-4 12 Wozney et al. (1 988) Science 242: 1 528- 1 534 

BMP-5 13 Cdeste gt al- (1990) Pro£. Natl Acad Sd. USA 82: 9843-9847 

BMP-6 1 4 Cdeste et al. (1 990) Proc. Natl Acad. Sd. IJSA 87: 9843-9847 

25 Dorsalin 15 Basler et al. (1993) Cell 73:687-702 

OP- 1 16 Cdeste fit al. ( 1 990) Ptqc. Natl. Acad .Sd. USA 87: 9843-9847; 

Ozkaynak et al. (1990) EMBO J. 9:2085-2093 

OP-2 1 7 Ozkaynak et a}. ( 1 992) J. Biol. Chem . 267 : 25220-25227 

OP-3 1 8 Ozkaynak et aL PCT/W094/1 0203 Seq. I.D. No. 1 . 
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19 Lee (1990) Mfil- EndS2Q2QQi- 4: 1034-1040 

20 McPhOTon et ai. (1993) I gid. Chem . 268:3444-3449 

21 McPherron g{ ^. (1993) J. gig!. Chem . 268:3444-3449 

22 Mayo gt al. (1 986) P£SS Ngfl. Acad . Ss. USA 83:5849-5853; 
Stewart al. (1986) FEBSLett 206:329-334; Mason gt fil. (1986) 
Biochem . Biophvs . Res . Cnmtmm 135: 9 57-964 

23 Forage et al. (1986) Proc . Natl . Acad . Sd. USA 52:3091-3095; 
Cbeitov si ^ (1990) Biomed . Sq. 1:499-506 

24 Mason et gl. (1 986) Biochem . Biophvs . Res . Conunun . 135:957- 



In particular, it is contranplated that amino acid sequences defining finger 1 regions usefiil in 
the practice of the instant invention may be copied directly £"001 the amino acid sequence defining 
a finger 1 region for any TGF-P super&mily member identified herein. UsefiU intact fingio- 1 
r^ons inchide, but are not limited to 



15 



20 



25 



30 



TGF-pl 


SEQ. 


ID. No. 


I, residues 2 through 29, 


TGF-P2 


SEQ. 


ID. No. 


2, residues 2 through 29, 


TGF-pB 


SEQ. 


ID. No. 


3, residues 2 through 29, 


TGF-P4 


SEQ. 


ED. No. 


4, residues 2 through 29, 


TGF-P5 


SEQ. 


ID. No. 


5, residues 2 through 29, 


dpp 


SEQ. 


ID. No. 


6, residues 2 through 29, 


Vg-1 


SEQ. 


ID. No. 


7, residues 2 through 29, 


Vgr-1 


SEQ. 


ID. No. 


8, residues 2 throu^ 29, 


60A 


SEQ. 


ID. No. 


9, residues 2 throu^ 29, 


BMP-2A 


SEQ. 


ID. No. 


10, residues 2 through 29, 


BMP-3 


SEQ. 


ID. No. 


1 1 , residues 2 through 29, 


BMP-4 


SEQ. 


ID. No. 


12, residues 2 through 29, 


BMP-5 


SEQ. 


ID. No. 


13, residues 2 through 29, 


BMP-6 


SEQ. 


ID. No. 


14, residues 2 through 29, 


Dorsalin 


SEQ. 


ID. No. 


15, residues 2 through 29, 


OP-1 


SEQ. 


ID. No. 


1 6, residues 2 through 29, 


OP-2 


SEQ. 


ID. No. 


17, residues 2 through 29, 


OP-3 


SEQ. 


ID. No. 


1 8, re»dues 2 through 29, 


GDF-1 


SEQ. 


ID. No 


19, residues 2 through 29, 


GDF-3 


SEQ. 


ID. No. 


20, residues 2 through 29, 
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GDF-9 SEQ. ID. No. 21, readues 2 through 29, 

In!ubm a SEQ. ID. No. 22, residues 2 through 29, 

Inhibin SEQ. ID. No. 23, residues 2 through 29, and 

Inhibtn PB SEQ. ID. No. 24, residues 2 through 29. 

5 It is contemplated also that amino add sequences defining heel regions useful in the practice 

of the instant invention may be copied directly from the amino add sequoice defining an intact 
bed r^on for any TGF-^ superfamily mraiber identified h^n. Usefiil intaa heel r^ons may 
inchide, but are not limited to 





I vFr-p 1 


^PO TT) "Wn 1 r^ciriiipc 1^ thrmiah M 


in 


1 VJF-p^ 








^PO rn 1 n»«irii]e< thrnnah ^7 




1 \ jr*p*r 


0X1'^^. ixj. iNO. *r, rcsiuutSD 3^ uiruugn oz. 






CP/^ m ISJo S r^ctrfaipc fhrraicrh 1^9 
JULr. iNu. icsiuiics J J mruugn u^, 




dpp 


oE\^. 11/. jNu. icssuuucs J J mrougn 




Vg-I 


djc^^. XL/. iNu. iy rcsiQues dd uirougn oj^ 




Vcrr 1 

vgr-1 


11/. r*o. o, Tcsiiiues 33 uirougn 




AHA 
OUA 


ox^V. ill. NO. 7, resiuues 33 uirougn 




BMP-2 


SEQ. ID. No. 10. residues 35 through 64, 




BMP3 


SEQ. ID. No. 1 1, residues 35 through 66, 


20 


BMP-4 


SEQ. ID. No. 12, residues 35 through 64, 




BMP-5 


SEQ. ID. No. 13, readues 35 through 65, 




BMP-6 


SEQ. ID. No. 14, residues 35 through 65, 




Dorsalin 


SEQ. ID. No. 15, residues 35 through 65, 




OP-l 


SEQ. ID. No. 16, residues 35 through 65, 


25 


OP-2 


SEQ. ID. No. 17, residues 35 through 65, 




OP-3 


SEQ. ID. No. 18, residues 35 through 65, 




GDF-1 


SEQ. ID. No. 19, residues 35 through 70, 




GDF-3 


SEQ. ID. No. 20, residues 35 through 64, 




GDF.9 


SEQ. ID. No. 21, residues 35 through 65, 


30 


Inhibin a 


SEQ. ID. No. 22, residues 35 through 65, 




Inhibin PA 


SEQ. ID. No. 23, residues 35 through 69, and 




Inhibin PB 


SEQ. ID. No. 24, residues 35 through 68. 



It is contemplated also that amino acid sequences defining finger 2 regions useful in the 
practice of the instant invention may be copied directly from the amino acid sequence defining an 
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intact finger 2 region for any TGF-p supcrfemily monber identified herein. Usefiil intact finger 2 
regions may include, but are not limited to 





TGF-31 


SEO. ID. No. 1. residues 65 throueh 94 




TGF-P2 


SEO. ID No 2 residues 65 throudi 94 


5 




SEO ID No 3 residues 65 throuoh 94 






SEO ID No 4 residues 65 thrnimh 04 




TCT-PS 


SEO ID No S residues 65 thrauoh 94 




dOD 


SEO ID No 6 restdues 6R thmiioh OR 




Vb-1 


SEO ID No 7 residuM 6S thmnoh Qft 


10 


Ver-l 


SEO ID No 8 residues 6R thmnah OR 




60A 


SEO ID No 9 residues 6R thrnnah OR 




BMP-2A 


SEO ID No 10 refiidiiiK 67 fhrruiah 07 




BMP-3 


SEO ID No 1 1 re5ndi]p^ 60 thrmioh 00 




BMP-4 


SEO ID Nn 1 2 reddnpc f\7 thrrwioh 07 


15 


BMP-S 


SEO ID No 1 1 residiiec ^R thrnnoh OR 




BMP-6 


SRO ID Nn 14 rradn^c ^R thrrknoh OR 




Dorsalin 


■^•Ct\<> Ai^. i^u. icsiiiucs Do uiiuuRn yy. 




OP-1 


•j*--»v^. uur. i^u. lu, icsiuucs Do uirougn 70, 




OP-2 


SEQ. ID. No. 17, readues 68 through 98, 


20 


OP-3 


SEQ. ID. No. 1 8, residues 68 through 98, 




GDF-l 


SEQ. ID. No. 19, residues 73 through 103, 




GDF-3 


SEQ. ID. No. 20, residues 67 dirough 97, 




GDF-9 


SEQ. ID. No. 21, residues 68 through 98, 




Inbibina 


SEQ. ID. No. 22, residues 68 through 101, 


25 


Inhibin PA 


SEQ. ID. No. 23, residues 72 through 102, and 




Inhibit! PB 


SEQ. ID. No. 24, residues 71 through 101. 



Furthermore, it is contemplated that the size of the constructs may be reduced significantly 
by truncating the natural finger and heel regions of the template TGF-P superfamily members, 
while compensating for dimensional change using linkers as disclosed hereinbelow. Preferred 

30 truncated polypeptide sequences that define the finger 1 , heel and finger 2 domains are shown in 
Figs. 7, 8, and 9, respectively. The truncated polypeptides of the finger and heel regions are 
defined as: large peptides (typically about 25 amino acids in length); medium peptides (typically 
about 20 amino acids in length); or small peptides (typically about 16 amino acids in length). 
Each of the aforementioned large, medium, and small finger and heel region peptides preferably 

35 comprise an additional residue(s) i.e., either cys or ser-cys at the N-termini and either cys or cys- 
ser at the C-termini in order to introduce structure stabilizing disulfide bonds into the constructs. 
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Ahhough, the additional N- and C-tmninal serine readues may enhance the water sohibility of the 
constnicts, it is anticipated that polar amino add residues other than serine may likewise be used 
to enhance solubility of the pq>tides. 

Specifically, Fig. 7 shows preferred amino add sequences, in a single letter code, for the 
5 finger 1 large peptide (Fig. 7A), the finger 1 medium peptide (Fig. 7B) and the finger 1 small 
pq)tide (Fig. 7C) for each member of the TGF-P superfamily. Consensus sequence patterns, 
described m detail below, for each subgroup are shown also at the bottom of each subgroup. In 
the patterns, upper case letters identify conserved amino adds in the standard single letter amino 
add code, the upper case letter "X" indicates that any naturally occurring amino add is accqitable 
10 at that particular location. Lower case letters identify groups of amino adds, in accordance with 
the pattern definition table set forth in Fig. 10, that also may be introduced into that position 
without affecting the structure of the resulting morphon construct. 

Fig. 7D shows stereo alpha carbon traces illustrating the whole finger 1 region whereas 
Figs. 7E, 7F, and 7G, show stereo alpha carbon traces of the finger 1 large, medium, and small 

1 S pqjtides, respectively. In Fig. 7D, the lines labded 72, 73, and 74 pass through the alpha carbons 
that are cyclized to form the large, medium, and small peptides of the finger 1 region, 
respectively. The lines with primed labels pass through the cyclized alpha carbons and correspond 
to the similariy numbered unprimed lines in Fig. 7D. It is apparent fi^om Figs. 7D-7G that the 
structural features indicative of the finger 1 region can be maintained while reducing the overall 

20 size of the finger 1 element. 

It is contemplated that large finger 1 peptides useful in the practice of the instant invention 
may be sdected from a finger 1 sequence belonging any one of the TGF-P super&mily members 
identified herein. Large finger 1 peptides usefiil in the practice of the instant invention are set 
forth below and include, for example: 



TGF-P 1 


SEQ. 


ED. No. 


1, residues 5 through 28, 


TGF-32 


SEQ. 


ID. 


No. 


2, residues S through 28, 


TGF-P3 


SEQ. 


ID. 


No. 


3, residues 5 through 28, 


TGF-P4 


SEQ. 


ID. 


No. 


4, residues 5 through 28, 


TGF-P5 


SEQ. 


ID. 


No. 


S, residues 5 through 28, 


dpp 


SEQ. 


ID. 


No. 


6, residues 5 through 28, 


Vg-1 


SEQ. 


ID. 


No. 


7, residues 5 through 28, 


Vgr-1 


SEQ. 


ID. 


No. 


8, residues 5 through 28, 


60A 


SEQ. 


ED. 


No. 


9, residues 5 through 28, 


BMP-2A 


SEQ. 


ID 


No. 


10, residues 5 through 28, 


BMP-3 


SEQ. 


ID. 


No. 


1 1 , residues 5 through 28, 
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10 



15 



20 



25 



30 



BMP-4 SEQ. ID. No. 12, residues 5 through 28. 

BMP-5 SEQ. ID. No. 13, residues 5 through 28, 

BMP-6 SEQ. m. No. 14, residues 5 through 28, 

Dorsalin SEQ. ID. No. 1 5, residues 5 through 28, 

OP-1 SEQ. ID. No. 16, residues 5 through 28, 

OP-2 SEQ. ID. No. 17, residues 5 through 28, 

OP-3 SEQ. ID. No. 1 8, readues 5 through 28. 

CHDF-l SEQ. ID. No. 19, residues 5 through 28. 

CH>F-3 SEQ. ID. No. 20, residues 5 through 28, 

GDF-9 SEQ. ID. No. 2 1 , residues 5 through 28. 

Inhibma SEQ. ID. No. 22, residues 5 through 28, 

Inhiinn SEQ. ID. No. 23, residues 5 through 28, and 

InhSnn PB SEQ. ID. No. 24, residues 5 through 28, 

In addition, it is contemplated that medium finger 1 peptides useful in the practice of the 
instant invention may be selected fi-om a finger 1 sequence belonging any one of the TGF-3 
superfamily m^nbos identified herein. Medium finger I peptides usefiil in the practice of the 
instant invention are set forth below and include, for exanq>le: 



35 



TGF-31 

TGF-P2 

TGF-33 

TGF-P4 

TGF-P5 

dpp 

Vg-l 

Vgr-1 

60A 

BMP-2A 

BMP-3 

BMP-4 

BMP-5 

BMP-6 

Dorsalin 

OP-1 

OP-2 

OP-3 

GDF-1 



SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ED. No. 
SEQ. ID. No 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 



1, residues 7 through 26, 

2, residues 7 through 26, 

3, residues 7 through 26, 

4, residues 7 through 26, 

5, residues 7 through 26, 

6, residues 7 through 26, 

7, residues 7 through 26, 

8, residues 7 through 26, 

9, residues 7 through 26, 

10, residues 7 through 26, 

1 1 , residues 7 through 26, 

1 2, residues 7 through 26, 

13, residues 7 through 26, 

14, residues 7 through 26, 

15, residues 7 through 26, 

16, residues 7 through 26, 

17, residues 7 through 26. 

18, residues 7 through 26, 

19, residues 7 through 26. 
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GDF-3 SEQ. ID. No. 20, residues 7 through 26, 

Ca>F-9 SEQ. ID. No. 21, residues 7 through 26, 

Inhibin a SEQ. ID. No. 22, residues 7 through 26, 

Inhibin fiA SEQ. ID. No. 23. residues 7 through 26, and 
5 Inhibin pB SEQ. ID. No. 24, residues 7 through 26, 

In addition, it is contemplated that small finger I peptides usefiil in the practice of the instant 
invention may be selected firom a finger 1 sequence belonging any one the TGF-p superfiunily 
members idmtified herem. Small finger 1 peptides useful in the practice of the instant mvention 
are set forth below and incbde, for example: 



10 


TGF-pl 


SEQ. ID. No. 1, residues 1 1 through 22, 




TGF-P2 


SEQ. ID. No. 2, residues 1 1 through 22, 






SEQ. ID. No. 3, residues 11 through 22, 




TGF-P4 


SEQ. ID. No. 4, readues 1 1 through 22, 




TGF-PS 


SEQ. ID. No. S, residues 11 through 22, 


IS 


dpp 


SEQ. ID. No. 6, residues 11 through 22, 




Vg-I 


SEQ. ID. No. 7, residues 11 through 22, 




Vgr-I 


SEQ. ID. No. 8, residues 1 1 through 22, 




60A 


SEQ. ID. No. 9, residues 1 1 through 22, 




BMP-2A 


SEQ. ID. No. 10, residues 1 1 through 22, 


20 


BMP-3 


SEQ. ID. No. 11, residues 1 1 through 22, 




BMP-4 


SEQ. ID. No. 12, residues 1 1 through 22, 




BMP-5 


SEQ. ID. No. 13, readues 1 1 through 22, 




BMP-6 


SEQ. ID. No. 14, residues 1 1 through 22, 




Dorsalin 


SEQ. ID. No. 15, residues 1 1 through 22, 


25 


OP-l 


SEQ. ID. No. 16, residues 1 1 through 22, 




OP-2 


SEQ. ID. No. 17, residues 11 through 22, 




OP-3 


SEQ. ID. No. 18, residues 1 1 through 22, 




GDF-1 


SEQ. ID. No. 19, residues 1 1 through 22, 




GDF-3 


SEQ. ED. No. 20, residues 1 1 through 22, 


30 


GDF-9 


SEQ. ID. No. 21, residues 1 1 through 22, 




Inhibin a 


SEQ. ID. No. 22, residues 1 1 through 22, 




Inhibin PA 


SEQ. ID. No. 23, residues 1 1 through 22, and 




Inhibin PB 


SEQ. ID. No. 24, residues 1 1 through 22. 



Similarly, it is contemplated that the large, medium and small subgroup peptide patterns 
35 shown in Figs. 7A, 7B, and 7C, respectively, for the TGF-p subgroup, the vg/dpp subgroup, the 
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GD¥ subgroup, and the Inhibin subgroup, likewise may be used in the practice of the instant 
invention. 

Fig. 8 shows preferred amino add sequences, in a single letter code, for the heel large 
pq)tide (Fig. 8A), the heel medhmi peptide (Fig. SB) and the heel small peptide (Fig. 8C) for each 
S member of the TGF-p superfamily. Consensus sequence patterns for each subgroup are shown 
also at the bottom of each subgroup. In the patterns, upper case letters identify conserved amino 
add residues in the standard single letter ammo add code, upper case letter "X" indicates that any 
naturally occurring amino add is accq>table at that location, lower case letter ""z" indicates that 
any naturally occurring amino add or a pq)tide bond is accqjtable at that location, and the other 
10 lower case letters identify groups of amino adds defined in accordance with the pattom definition 
table set forth in Fig. 10, that may be introduced at a pre-sdected podtion in the amino add 
sequence without affecting the structure of the resulting morphon construct. 

Fig. 8D shows stweo alpha carbon traces illustrating the whole heel region, whereas Figs. 
8E, 8F, and 8G show stereo alpha carbon traces of the hed region large, medium, and small 

1 5 peptides, respectivdy. In Fig. 8D, the lines labeled 82, 83, and 84 pass through the alpha carbons 
that are cyclized to form the large, medium, and small peptides of the heel region, respectively. 
The lines with primed labels pass through the cyclized alpha carbons and correspond to the 
similariy numbered unprimed lines in Fig. 8D. It is apparent fi-om Figs. 8D-8G that the structural 
features indicative of the heel region can be maintained while reducing the overall size of the heel 

20 element. 

It is contemplated that large heel peptides usefiil in the practice of the instant invention may 
be selected from a heel region sequence belonging any one of the TGF-P superfamily members 
identified herein. Large heel peptides useful in the practice of the instant invention are set forth 
below and include, for example: 



TGF-pl 


SEQ. 


ID. 


No. 


1, residues 46 through 61, 


TGF-32 


SEQ. 


ID. 


No. 


2, residues 46 through 61, 


TGF-P3 


SEQ. 


ID. 


No. 


3, residues 46 through 61, 


TGF-P4 


SEQ. 


ID. 


No. 


4, residues 46 through 61, 


TGF-pS 


SEQ. 


ID. 


No. 


5, residues 46 through 61, 


dpp 


SEQ. 


ID. 


No. 


6, residues 49 through 64, 


Vg-1 


SEQ. 


ID. 


No. 


7, residues 49 through 64, 


Vgr-1 


SEQ. 


ID. 


No. 


8, residues 49 through 64, 


60A 


SEQ. 


ID. 


No. 


9, residues 49 through 64, 


BMP-2A 


SEQ. 


ID. 


No. 


10, residues 49 through 63, 


BMP-3 


SEQ. 


ID. 


No. 


1 1, residues 49 through 65, 
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BMP'4 SEQ. ID. No. 12, residues 49 through 63, 

BMP-5 SEQ. ID. No. 13, residues 49 throu^ 64, 

BMP-6 SEQ. ID. No. 14, residues 49 through 64, 

Dorsalin SEQ. ID. No. 15, readues 49 through 64, 

OP-1 SEQ. ID. No. 16, residues 49 through 64, 

OP-2 SEQ. ID. No. 1 7, readues 49 through 64. 

OP-3 SEQ. ID. No. 1 8, residues 49 through 64, 

GDF-1 SEQ. ID. No. 19, residues 53 through 69, 

GDF-3 SEQ. ID. No. 20, residues 49 through 63, 

GDF-9 SEQ. ID. No. 2 1 , residues 49 through 64, 

Inhibin a SEQ. ID. No. 22, residues 5 1 through 64, 

Inhibin PA SEQ. ID. No. 23, residues 51 through 68, and 
Inhibin SEQ. ID. No. 24, residues 5 1 through 67. 

In addition, it is contemplated that medium heel peptides usefiil in the practice of the instant 
invention may be selected from a hed sequence belonging any one of the TGF-P superfamily 
mnnbers idoitified herein. Medhim heel peptides useful in the practice of the instant invention 
are set forth below and include, for example: 



35 



TGF-pi 

TGF-32 

TGF-P3 

TGF-P4 

TGF-PS 

dpp 

Vg-1 

Vgr-l 

60A 

BMP-2A 

BMP-3 

BMP-4 

BMP-5 

BMP-6 

Dorsalin 

OP-1 

OP-2 

OP-3 

GDF-1 



SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ED. No. 
SEQ ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 
SEQ. ID. No. 



1, residues 

2, residues 

3, residues 

4, residues 

5, residues 

6, readues 

7, residues 

8, residues 

9, residues 

10, residues 

11, residues 

12, residues 

13, residues 

14, residues 

1 5, residues 

16, residues 

17, residues 

18, residues 

19, residues 



49 through 61, 
49 through 61, 
49 through 61, 
49 through 61, 
49 through 61, 
52 through 64, 
52 through 64, 
52 through 64, 
52 through 64, 
52 through 63, 
52 through 65, 
52 through 63, 
52 through 64, 
52 through 64, 
52 through 64, 
52 through 64, 
52 through 64, 
52 through 64, 
56 through 69, 
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GDF-3 SEQ. ID. No. 20, readues 52 through 63, 

GDF-9 SEQ. ID. No. 21, residues 52 through 64, 

Inhibm a SEQ. ID. No. 22, residues 54 through 64, 

Inhibin PA SEQ. ID. No. 23, residues 54 through 68, and 
S Inhibin pS SEQ. ID. No. 24, residues 54 through 67. 

Furthermore, it is contemplated that smaH heel peptides useful in the practice of the instant 
invention may be selected from a hed sequence belonging any one of the TGF-p superfamily 
members identified herdn. Small heel peptides usefid in the practice of the instant invention are 
set fbith below and inchide, for example: 



10 


TGF-pl 


SEO ID No I residues 53 throuch S9 




TGF-P2 


SEO ID No 2 residues 53 through 5Q 




TGF-p3 


SEO ID No 3 residues 53 throuah 5Q 




TGF-p4 


SEO ID No 4 residues 53 through 5Q 




TGF-P5 


SEO ID No 5 residues 53 throuah 5Q 


15 




SEO ID No 6 residues 56 through 




Ve-1 


SEO ID No 7 residues 56 throii&h 6? 






^PO TD Nn S r^iHitAC ^6 fKrraiok 1^0 

O-Cr^^. ixj. iHsj. Oy rcsioues uirougn OiC, 




60A 


SEQ. ID. No. 9, residues 56 through 62, 




BMP-2A 


SEQ. ID. No. 10, residues 56 through 61, 


20 


BMP-3 


SEQ. ID. No. 1 1, residues 56 through 63, 




BMP-4 


SEQ. ID. No. 12, residues 56 through 61, 




BMP-5 


SEQ. ID. No. 13, residues 56 through 62, 




BMP-6 


SEQ. ID. No. 14, residues 56 through 62, 




Dorsalin 


SEQ. ID. No. 15, residues 56 through 62, 


25 


OP-1 


SEQ. ID. No. 16, residues 56 through 62, 




OP-2 


SEQ. ID. No. 17, readues 56 through 62, 




OP-3 


SEQ. ID. No. 18, residues 56 through 62, 




GDF-1 


SEQ. ID. No. 19, residues 60 through 67, 




GDF-3 


SEQ. ID. No. 20, residues 56 through 61, 


30 


GDF-9 


SEQ. ID. No. 21, residues 56 through 62, 




Inhibin a 


SEQ. ID. No. 22, residues 58 through 62, 




Inhibin PA 


SEQ. ID. No. 22, residues 58 through 66, and 




Inhibin PB 


SEQ. ID. No. 23, residues 58 through 65. 



Similarly, it is contemplated that the large, medium and small heel subgroup peptide patterns 
35 shown in Figs. 8A, 8B, and 8C, respectively, for the TGF-p subgroup, the vg/dpp subgroup, the 
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GDF subgroup, and the Inhibin subgroup, also may be used in the practice of the instant 
invention. 

Fig. 9 shows preferred amino acid sequences, in a single amino acid code for the finger 2 
large peptide (Fig. 9A), the finger 2 medium peptide (Fig. 9B) and the finger 2 small peptide (Fig. 
9C) for each member of the TGF-P superfiunily. Consensus sequence patterns for each subgroup 
are shown also at the bottom of each subgroup. In the patterns, upper case letters identify 
conserved amino adds in the standard single letter amino add code, upper case letter "X" 
indicates that any naturally occurring amino add is acceptable at that location, lower case letter 
V indicates that aay naturally occurring amino add or a pc^de bond is acceptable at that 
location, and the other lower case letters identify groups of amino adds, in accordance with the 
pattern definition table set forth in Fig. 10, that may be introduced at a pre-sdected position in the 
amino add sequence without affecting the structure of the resulting morphon construct. 

Fig. 9D shows stereo alpha carbon traces illustrating the whole finger 2 region, whereas 
Figs. 9E, 9F, and 9G show stereo alpha carbon traces of the finger 2 large, medhim, and small 
pq)tides, rcspectivdy. In Fig. 9D, the lines labded 92, 93, and 94 pass through the alpha caibons 
that are cyclized to form the large, medhmi, and small peptides of the finger 2 region, 
respectivdy . The lines with primed labels pass through the cyclized alpha carbons and correspond 
to the similarly numbered unprimed lines in Fig. 9D. It is q)parent fi-om Figs. 9D-9G that the 
structural features indicative of the finger 2 region can be maintained while reducing the overall 
size of the finger 2 element. 

It is contwnplated that large finger 2 peptides usefiil in the practice of the instant invention 
may be selerted fi-om a finger 2 sequence belonging any one of the TGF-P superfemily members 
identified herein. Large finger 2 peptides usefiil in the practice of the instant invention are set 
forth below and include, for example: 



TGF-P I 


SEQ. ED. 


No. 


1, residues 72 through 87, 


TGF-P2 


SEQ. 


ID. 


No. 


2, residues 72 through 87, 


TGF-P3 


SEQ. 


ID. 


No. 


3, residues 72 through 87, 


TGF-P4 


SEQ. 


ID. 


No. 


4, residues 72 through 87, 


TGF-PS 


SEQ. 


ID. 


No. 


5, residues 72 through 87, 


dpp 


SEQ. 


ID. 


No. 


6, residues 75 through 91, 


Vg-1 


SEQ. 


ID. 


No. 


7, residues 75 through 91, 


Vgr-l 


SEQ. 


ID. 


No. 


8, residues 75 through 91, 


60A 


SEQ. 


ID. 


No. 


9, residues 75 through 91, 


BMP-2A 


SEQ. 


ID. 


No. 


1 0, residues 74 through 90, 


BMP-3 


SEQ. 


ID. 


No. 11, residues 76 through 92, 
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BMP-4 SEQ. ID. No. 12, readues 74 through 90. 

BMP-S SEQ. ID. No. 13. residues 75 through 91. 

BMP-6 SEQ. ID. No. 14, residues 75 through 91, 

Dorsalin SEQ. ID. No. 1 5, residues 75 through 92. 

OP-1 SEQ. ID. No. 16, residues 75 through 91 , 

OP-2 SEQ. ID. No. 17, residues 75 through 91 , 

OP-3 SEQ. ID. No. 1 8, readues 75 through 9 1 , 

GDF- 1 SEQ. ID. No. 1 9, residues 80 through 96, 

GDF-3 SEQ. ID. No. 20, residues 74 through 90, 

GDF-9 SEQ. ID. No. 21, residues 75 through 91, 

Inhibina SEQ. ID. No. 22, residues 77 through 94, 

Inhibin 3A SEQ. ID. No. 23, residues 79 through 95, and 

Inhibin SEQ. ID. No. 24, residues 78 throu^ 94. 



In addition, it is contemplated that medium finger 2 peptides usefiil in the practice of the 
15 instant invention may be selected from a finger 2 sequence belonging any one of the TGF-P 
superfamily members identified herein. Medium finger 2 peptides usefiil in the practice of the 
instant invention are set forth below and include, for example: 





TGF-pl 


SEQ ID. 


No. 


1, residues 74 through 85, 




TGF-P2 


SEQ. ID. 


No. 


2, residues 74 through 85, 


20 


TGF-P3 


SEQ. ID. 


No. 


3, residues 74 through 85, 




TGF-P4 


SEQ. ID. 


No. 


4, residues 74 through 85, 




TGF-P5 


SEQ. ID. 


No. 


5, residues 74 through 85, 




dpp 


SEQ. ID. 


No. 


6, residues 77 through 89, 




Vg-1 


SEQ. ID. 


No. 


7, residues 77 through 89, 


25 


Vgr-1 


SEQ. ID. 


No. 


8, residues 77 through 89, 




60A 


SEQ. ID. 


No. 


9, residues 77 through 89, 




BMP-2A 


SEQ. ED 


No. 


10, residues 76 through 88, 




BMP-3 


SEQ. ID. 


No. 


1 1 , residues 78 through 90, 




BMP-4 


SEQ ID. 


No. 


12, residues 76 through 88, 


30 


BMP-5 


SEQ. ID. 


No. 


13, residues 77 through 89, 




BMP-6 


SEQ. ID. 


No. 


14, residues 77 through 89, 




Dorsalin 


SEQ. ID. 


No. 


15, residues 77 through 90, 




OP-1 


SEQ. ID. 


No. 


16, residues 77 through 89, 




OP-2 


SEQ. ID. 


No. 


17, residues 77 through 89, 


35 


OP-3 


SEQ. ID. 


No. 


18, residues 77 through 89, 




GDF-1 


SEQ. ID. 


No. 


19, residues 82 through 94, 
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CH>F-3 SEQ. ID, No. 20, residues 76 throu^ 88, 

GDF-9 SEQ. ID. No. 21, residues 77 through 89, 

lohibin a SEQ. ID. No. 22, residues 79 through 92, 

Inhibin SEQ. ID. No. 23, residues 8 1 through 93, and 

5 Inhibin SEQ. ID. No. 24, residues 80 through 92. 

Furthermore, it is contemplated that small finger 2 pq)tides useful in the practice of the 
instant invention may be selected firom a finger 2 sequence belonging any one of the TGF-p 
super&mily mmibers identified herein. Small finger 2 peptides usefiil in the practice of the instant 
invention are set forth below and include, for example: 



10 


TGF-P 1 


SEO ID No 1 residupQ 76 thrnnoh SI 




TGF-p2 


SEO ID No 2 m«frhiM 7ft thmnak ft"^ 




TGF-p3 


SEO ID No 3 reslHiipc 76 ttimnah k"^ 




TGF-P4 






TGF-P5 


SEO ID No 5 resirhie^ 76 thrnnah ft'^ 


15 


doD 


SEO ID No 6 residues 7Q thrnnoH R7 






SEO ID No 7 residues 70 ttimnvh R7 




Ver-1 


SFO rn Nn R mctrfitM 70 ftimitnk fiT 

j-L^. i^u. o, rcsiuucs iy mrougn o /, 




60A 


SEQ. ID. No. 9, residues 79 through 87, 




BMP.2A 


SEQ. ID. No. 10, residues 78 through 86, 


20 


BMP.3 


SEQ. ID. No. 1 1, residues 80 through 88, 




BMP^ 


SEQ. ID. No. 12, residues 78 through 86, 




BMP-5 


SEQ. ID. No. 13, residues 79 through 87, 




BMP.6 


SEQ. ID. No. 14, residues 79 through 87, 




Dorsalin 


SEQ. ID. No. 15, residues 79 through 88, 


25 


OP-1 


SEQ. ID. No. 16, residues 79 through 87, 




OP.2 


SEQ. ID. No. 17, residues 79 through 87, 




OP-3 


SEQ. ID. No. 18, residues 79 through 87, 




GDF-l 


SEQ. ID. No. 19, residues 84 through 92, 




GDF.3 


SEQ. ID. No. 20, residues 78 through 86, 


30 


GDF-9 


SEQ. ID. No. 21, residues 79 through 87, 




Inhibin a 


SEQ. ID. No. 22, residues 81 through 90, 




Inhibin 3A 


SEQ. ID. No. 23, residues 83 through 91, and 




Inhibin PB 


SEQ. ID. No. 24, residues 82 through 90. 



Similarly, it is contemplated that the large, medium and small finger 2 subgroup peptide 
35 patterns shown in Figs. 9 A, 9B, and 9C, respectively, for the TGF-P subgroup, the vg/dpp 
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subgroup, the GDF subgroup, and the Inhibin subgroup, also may be used in the practice of the 
instant invention. 

In addition, it is contemplated that the amino acid sequences of the respective finger and heel 
regions may be altered by amino acid substitution, especially by exploiting substitute residues 
5 sdected in accordance with the principles disclosed in Smith et al. ( 1 990), supra . Briefly, Smith 
^ al. disclose an amino acid dass hierarchy similar to the one summarized in Fig. 1 0, vAnch may 
be used to rationally substitute one amino acid for another while minimizing gross conformational 
distortions of the type i^ch would inactivate protein function. In any event, it is contemplated 
that many synthetic first finger, second fingo*, and heel region sequences, having only 70% 
10 homology with natural regions, preferably 80%, and most preferably at least 90%, can be used to 
produce morphon constructs. 

Amino acid sequence patterns showing amino acids preferred at each location in the finger 
and heel regions, deduced in accordance with the principles described in Smith e£ aL (1 990) supra, 
also are show in Figs. 5-9, and are referred to as the: TGF-P; Vg/dpp; GDF; and Inhibin subgroup 
1 S patterns. The amino acid sequences defining the finger 1 , heel and finger 2 sequence patterns of 
each subgroup are set forth in Figs. 5 A, 5B, and 5C, respectively. In addition, the amino acid 
sequences defining the entire TGF-P, Vg/dpp, GDF and Inhibin subgroup patterns are set forth in 
the Sequence Listing as SEQ. ID. Nos. 25, 26, 27, and 28, respectively. 

The preferred amino acid sequence patterns for each subgroup, disclosed in Figures 5A, 5B, 
20 and 5C, and summarized in Figure 6, enable one skilled in the art to identify alternative amino 
acids that may be incorporated at specific positions in the finger 1, heel, and finger 2 elements. 
The amino adds identified in upper case letters in a ^ngle letter amino acid code identify 
conserved amino acids that together are believed to define structural and functional elements of 
the finger and heel regions. The upper case letter "X" in Figs. 5 and 6 indicates that any naturally 
25 occurring amino acid is acceptable at that position. The lower case letter "z" in Figs. 5 and 6 
indicates that either a gap or any of the naturally occurring amino acids is acceptable at that 
position. The lower case letters stand for the amino acids indicated in accordance with the pattern 
definition table set forth in Figure 10 and identify groups of amino acids which are useful in that 
location. 

30 In accordance with the amino acid sequence subgroup patterns set forth in Figs. 5-9, it is 

contemplated, for example, that the skilled artisan may be able to predict that where applicable, 
one amino acid may be substituted by another without inducing disruptive stereochemical changes 
within the resulting protein construct. For example, in Fig 5A, in the TGF-P subgroup pattern at 
residue number 12 it is contemplated that either a lysine residue (K) or a glutamine residue (Q) 

35 may be present at this position without affecting the structure of the resulting construct. 
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Accordingly, the sequence pattern at position 12 contains an "n" which in accordance with Figure 
10 defines an amino add residue selected from the group consisting of lysine or glutamine. It is 
contemplated, therefore, that many synthetic finger 1, finger 2 and heel r^on amino add 
sequences, having 70% homology, preferably 80%, and most preferably at least 90% with the 
5 natural regions, may be used to produce conformationally active constructs of the invention. 

In accordance with these prindples, it is contemplated that one may design a synthetic 
morphon construct by starting with the amino add sequence patterns belonging to the TGF-P, 
Vg/dpp, GDP, or Inhibin subgroup patterns shown in Figs. 5 and 6. Thereafter, by using 
conventional recombinant DNA methodologies a presdected amino add may be substituted by 
10 another as guided by the prindples herein and the resulting protdn construct tested for bindii^ 
activity in combination with either agonist or antagonist activity. 

The TGF-p subgroup pattern, SEQ. ID. No. 25, accommodates the homologies shared 
among members of the TGF-P subgroup identified to date inchiding TGF-31, TGF-p2, TGF-P3, 
TGF-P4, and TGF-pS. The generic sequence, shown bdow, inchides both the conservred amino 
IS adds (standard three letter code) as well as akemative amino adds (Xaa) present at the variable 
positions within the sequence and defined by the rules set forth in Fig. 10. 

TGF-B Subgroup Pattern 

Cys Cys Val Arg Pro Leu Tyr lie Asp Phe Arg Xaa Asp Leu Gly Trp 
15 10 15 

Lys Trp He His Glu Pro Lys Gly Tyr Xaa Ala Asn Phe Cys Xaa Gly 

20 25 30 

Xaa Cys Pro Tyr Xaa Trp Ser Xaa Asp Thr Gin Xaa Ser Xaa Val Leu 
35 40 45 

Xaa Leu Tyr Asn Xaa Xaa Asn Pro Xaa Ala Ser Ala Xaa Pro Cys Cys 
50 55 60 

Val Pro Gin Xaa Leu Glu Pro Leu Xaa He Xaa Tyr Tyr Val Gly Arg 
65 70 75 80 

Xaa Xaa Lys Val Glu Gin Leu Ser Asn Met Xaa Val Xaa Ser Cys Lys 
85 90 95 

Cys Ser. 

Each Xaa can be independently selected fi-om a group of one or more specified amino acids 
defined as follows, wherein: Xaal2 is Arg or Lys; Xaa26 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, 
Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa3 1 is Ala, Arg, Asn, Asp, 
Cys, Glu, Gin, Gly, His, Ile,Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa33 is Ala, Gly, 
5 Pro, Ser, or Thr; Xaa37 is lie. Leu, Met or Val; Xaa40 isAla, Arg, Asn, Asp, Cys, Glu, Gin, Gly, 
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His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa44 is His, Phe, Trp or Tyr; 
Xaa46 is Arg or Lys; Xaa49 is Ala, Giy, Pro, Ser, or Thr, XaaSB is Arg, Asn, Asp, dn, Qu, His, 
Lys, Scr or Thr, XaaS4 is Ala, Aig, Asn, Asp, Cys, (Hu, Gin, Gly, His, He, Leu, Lys, Met, Phe, 
Pro, Ser, Thr, Trp, Tyr or Val; XaaS7 is Ala, Arg, Asn, Asp, Cys, Glu, Gn, Gly, His, He, Leu, 
5 Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa61 is Ala, Gly, Pro, Ser. or Thr, Xaa68 is Ala, 
Arg, Asn, Asp, Cys, Glu, Gte, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; 
Xaa73 is Ala, Gly, Pro, Ser, or Thr, Xaa75 is lie. Leu, Met or Val; XaaSl is Arg, Asn, Asp, Gin, 
Ghi, His, Lys, Ser or Thr, Xaa82 is Ala, Gly, Pro, Ser, or Thr; Xaa91 is De or Val; Xaa93 is Arg 
or Lys. 

10 The Vg/dpp subgroup pattern, SEQ. ID. No. 26, accommodates the homologies shared 

among members of the Vg/dpp subgroup idoitified to date induding dpp, vg-1, vgr-1, 60A, 
BMP-2A(BMP-2), Dorsalin, BMP-2B (BMP-4), BMP-3, BMP-S, BMP.6, OP-1 (BMP-7). OP.2 
and OP-3. The generic sequence, below, includes both the conserved amino acids (standard three 
letter code) as well as alternative amino acids (Xaa) present at the variable positions within the 

1 5 sequence and defined by the rules set forth in Fig. 1 0. 

Vgg/dpp Subgroup Pattern 



Cys Xaa Xaa Xaa Xaa Leu Tyr Val Xaa Phe Xaa Asp Xaa Gly Trp Xaa 
15 10 15 



Asp Trp lie lie Ala Pro Xaa Gly Tyr Xaa Ala Xaa Tyr Cys Xaa Gly 
20 25 30 



Xaa Cys Xaa Phe Pro Leu Xaa Xaa Xaa Xaa Asn Xaa Thr Asn His Ala 
35 40 45 



lie Xaa Gin Thr Leu Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro 
50 55 60 



Lys Xaa Cys Cys Xaa Pro Thr Xaa Leu Xaa Ala Xaa Ser Xaa Leu Tyr 

65 70 75 BO 



Xaa Asp Xaa Xaa Xaa Xaa Xaa Val Xaa Leu Xaa Xaa Tyr Xaa Xaa Met 
85 90 95 



Xaa Val Xaa Xaa Cys Gly Cys Xaa. 
100 



Each Xaa can be independently selected fi'om a group of one or more specified amino acids 
defined as follows, wherein: Xaa? is Arg or Lys; Xaa3 is Arg or Lys; Xaa4 is Arg, Asn, Asp, Gin, 
Glu, His, Lys, Ser or Thr; Xaa5 is Arg, Asn, Asp, Gin, Glu, His, Lys, Ser or Thr; Xaa9 is Arg, 
Asn, Asp, Gin, Glu, His, Lys, Ser or Thr; Xaal 1 is Arg, Asn, Asp, Gin, Glu, His, Lys, Ser or Thr; 
5 Xaal3 is He, Leu, Met or Val; Xaal 6 is Arg, Asn, Asp, Gin, Glu, His, Lys, Ser or Thr, Xaa23 is 
Arg, Gin, Glu,or Lys; Xaa26 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, His, He, Leu, Lys, Mel, 
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Phe, Pro. Scr. Thr, Trp, Tyr or Val; Xaa28 is Phe, Trp or Tyr, Xaa3 1 is Arg, Asn, Asp, Gin. Ghi, 
His, Lys, Ser or Thr, Xaa33 Is Asp or Ghi; Xaa35 is Ala, Aig, Asn, Asp, Cys, Glu, Gin, Gly. His. 
He, Leu, Lys, Met, Phe, Pro, Scr, Tlir, Trp, Tyr or Val; Xaa39 is Ala, Arg, Asn, Asp, Cys, Ghi, 
Ghi. Gly, His, De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa40 is Ala, Arg, Asn, 
5 Asp, Cys, Glu, Ghi, Gly, His, De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa4 1 is 
Ala, Arg, Asn, Asp, Cys, Glu, Ghi, Gly, His, De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Tip, Tyr or 
Val; Xaa42 is Leu or Met; Xaa44 is Ala, Gly, Pro, Ser, or Thr, XaaSO is De or Val; Xaa55 is Aig, 
Asn, Asp, Ghi, Glu, His, Lys, Ser or Thr, Xaa56 is Ala, Arg. Asn, Asp, Cys, Ghi, Gin, Gly. Htts. 
De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa57 is lie, Uu, Met or Val; Xaa58 is 

10 Aig, Asn, Asp, Ghi, Ghi, IBs, Lys, Ser or Thr, Xaa59 is Ala, Arg, Asn, Asp, Cys, Glu, Gta, Gly, 
His, De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, Val or a peptide bond; Xaa60 is Ala, Arg, 
Asn, Asp, Cys, Glu, Ghi, CHy, His, De, Leu, Lys. Met, Phe, Pro, Ser, Thr, Tip, Tyr, Val or a 
peptide bond; Xaa61 is Aig, Asn, Asp, Ghi, Glu, His, Lys, Ser or Tlir; Xaa62 is Ala, Arg, Asn, 
A^, Cys, Ghi, Ghi, Gly, Ks, De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa63 is De 

1 5 or Val; Xaa66 is Ate, Gly, Pro, Ser, or Thr, Xaa69 is Ala, Arg, Asn, Asp, Cys, Ghi, Ghi, Gly, His, 
De, Loi, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa72 is Arg, Ghi, Ghi,or Lys; Xaa74 is 
Arg, Asn. Asp, Gta, Glu, Ks, Lys, Ser or Thr; Xaa76 is De or Val; Xaa78 is De, Leu, Met or Val; 
XaaSl is Cys, De, Leu, Met, Phe, Trp, Tyr or Val; Xaa83 is Asn, Asp or Ghi; Xaa84 is Arg. Asn, 
Asp, Gta, Glu, His, Lys, Ser or Thr; Xaa85 is Ala, Arg. Asn, Asp, Cys, Ghi, Gta, Gly, His, De, 

20 Leu, Lys. Met, Phe, Pro, Ser, Thr. Trp, Tyr, Val or a peptide bond; Xaa86 is Arg. Asn, Asp, Gta, 
Glu, His, Lys, Ser or Thr; Xaa87 is Arg, Asn, Asp, Gta, Glu, His, Lys, Ser or Thr; Xaa89 is De or 
Val; Xaa91 is Arg or Lys; Xaa92 is Arg, Asn, Asp, Gta, Glu, His, Lys, Ser or Thr, Xaa94 is Arg, 
Gta, Glu,or Lys; Xaa9S is Asn or A^; Xaa97 is Ala, Arg. Asn, Asp, Cys, Glu, Qn, Gly, His, De, 
Leu, Lys, Met, Phe, Pro, Scr, Thr, Trp, Tyr or Val; Xaa99 is Arg, Gta, Glu, or Lys; Xaal 00 is 

25 Ala, Gly, Pro, Ser, or Thr, Xaal04 is Arg, Asn, Asp, Qn, Glu, Ks, Lys, Ser or Thr. 

The GDF subgroup pattern, SEQ. ID. No. 27, accommodates the homologies shared among 
members of the GDF subgroup identified to date including GDF-1, GDF-3, and GDF-9. The 
generic sequence, shown below, inctades both the conserved amino acids (standard three letter 
code) as well as alternative amino acids (Xaa) present at the variable positions within the 
30 sequence and defined by the rules set forth in Fig. 10. 

GDF Subgroup Pattern 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Trp Xaa 
15 10 as 

Xaa Trp Xaa Xaa Ala Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Gly 
20 25 30 

Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

50 55 60 

Pro Xaa Xaa Xaa Xaa Xaa Xaa Cys Val Pro Xaa Xaa Xaa Ser Pro Xaa 
65 70 75 80 

Ser Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr 
B5 90 95 

Glu Asp Met Xaa Xaa Xaa Xaa Cys Xaa Cys Xaa. 
100 105 



Each Xaa can be iiidq)endently selected from a group of one or more spedfied amino adds 
defined as follows, wherein: Xaa2 is Arg, Asn, Asp, GHn, Glu, His, Lys, Ser or Thr, Xaa3 is Ala, 
Aig, Asn, Asp, Cys. Glu, Gin, Giy, His, De, Leu, Lys. Met, Phe, Pro, Set, Thr, Trp, Tyr or Val; 
Xaa4 is Arg, Asn, Asp, Ghi, Giu, His, Lys, Ser or Thr, Xaa5 is Arg, Asn, Asp, CHn, Gfai, His, Lys, 
5 Ser or Thr, Xaa6 is Cys, He, Leu, Met, Phe, Trp, Tyr or Val; Xaa7 is Ala, Aig, Asn, Asp, Cys, 
Ghi, CHn, Gly, His, He. Leu, Lys. Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; XaaS is He, Leu, Met 
or Val; Xaa9 is Arg, Asn, Asp, Gin, Glu, His, Lys, Ser or Thr, Xaal 1 is Arg, Asn, Asp, Gin, Glu, 
His, Lys, Ser or Thr; Xaal2 is Arg, Asn, Asp, Gbi, Glu, His, Lys, Ser or Thr, Xaal3 is De, Leu, 
Met or Val; Xaal4 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, His, De, Leu, Lys, Met, Phe, Pro, 

10 Ser, Thr, Trp. Tyr or Val; Xaal6 is Arg, Asn, Asp, Gin, Glu. His, Lys. Ser or Thr, Xaal7 is Arg, 
Asn, Asp, Gin, Glu, His. Lys, Ser or Thr; Xaal9 is lie or Val; Xaa20 is He or Val; Xaa23 is Arg, 
Asn, Asp, Gin, Glu, His, Lys, Ser or Thr; Xaa24 is Ala, Arg, Asn, Asp. Cys. Glu, Gin, Gly. His, 
De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa25 is Phe, Trp or Tyr, Xaa26 is Ala, 
Arg, Asn, Asp, Cys. Glu, Gin, Gly, His, Ue. Leu, Lys, Met, Phe. Pro, Ser. Thr, Trp, Tyr or Val; 

15 Xaa27 is Ala, Giy, Pro, Ser, or Thr, Xaa28 is Arg, Asn, Asp, Gin, Glu, His, Lys, Ser or Thr, 

Xaa29 is Phe, Trp or Tyr; Xaa3 1 is Arg, Asn, Asp, Gin, Glu, His, Lys, Ser or Thr, Xaa33 is Arg, 
Asn. Asp. Gin, Glu, His, Lys, Ser or Thr; Xaa35 is Ala, Gly, Pro, Ser, or Thr; Xaa36 is Ala, Arg, 
Asn, Asp, Cys, Giu, Gin, Gly, His, lie, Leu, Lys, Met, Phe. Pro, Ser, Thr, Trp, Tyr or Val; Xaa37 
is Ala, Gly, Pro, Ser, or Thr; Xaa38 is Ala, Arg, Asn, Asp. Cys, Glu, Gin, Giy, His, lie. Leu, Lys, 

20 Met, Phe. Pro. Ser, Thr, Trp, Tyr or Val; Xaa39 is Arg, Asn, Asp, Gin, Glu, His. Lys. Ser or Thr; 
Xaa40 is Ala, Arg, Asn. Asp, Cys, Glu, Gin, Gly, His, He, Leu, Lys. Met. Phe, Pro, Ser. Thr, Trp, 
Tyr or Val; Xaa41 is Ala. Arg, Asn, Asp, Cys. Glu, Gin, Giy, His, He, Leu, Lys, Met, Phe, Pro, 
Ser, Thr, Trp, Tyr or Val; Xaa42 is Ala, Arg, Asn, Asp, Cys. Giu, Gin, Gly, His, He, Leu, Lys. 
Met. Phe, Pro. Ser, Thr, Trp, Tyr or Val; Xaa43 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, His, 

25 He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, Val or a peptide bond; Xaa44 is Ala, Arg. Asn. 
Asp, Cys, Glu, Gin, Gly, His, Ue, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, Val or a peptide 
bond; Xaa45 is Ala. Arg, Asn, Asp, Cys, Glu, Gin, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, 
Thr. Trp. Tyr, Val or a peptide bond; Xaa46 is Ala, Arg. Asn, Asp, Cys, Glu. Gin. Giy, His. He, 
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Leu, Lys. Met, Phe, Pro, Ser, Thr. Tip. Tyr, Val or a peptide bond; Xaa47 is Ala. Arg, Asa, Asp, 
Cys. Ghi, Gin, Gly, His. He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa4g is Ala, 
Giy, Pro, Ser. or Thr, Xaa49 is Ala, Aig, Asn, Asp, Cys. Ghi, Gta, Qy, His, De. Leu, Lys. Met. 
Phe, Pro, Ser, Thr, Trp. Tyr or Val; Xaa50 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly. His, De, 
5 Leu, Lys, Met, Phe. Pro. Ser. Thr. Trp. Tyr or Val; Xaa5 1 is Ks. Phe. Tip or Tyr. XaaSl is Ala, 
Gly, Pro. Ser. or Thr; Xaa53 is Cys, He, Leu, Met, Phe, Trp, Tyr or Val; Xaa54 is De, Leu. Met 
or Val; Xaa55 is Aig, Gta. Ghi.or Lys; Xaa56 is Ala, Arg. Asn, Asp, Cys, Ghi, Gin, Gly, His. De, 
Leu, Lys, Met, Phe. Pro. Ser. Thr, Trp, Tyr or Val; Xaa57 is De, Leu, Met or Val; Xaa58 is De. 
Leu, Met or Val; Xaa59 is His, Phe, Trp or Tyr. Xaa60 is Ala, Arg. Asn, Asp. Cys. Glu, Gta, Gly, 
10 His. De, Leu. Lys. Met, Phe, Pro, Ser, Thr. Trp, Tyr or Val; Xaa61 is Ala. Arg, Asn, Asp, Cys, 
Giu. Gta, Gly, His, De, Le^ Lys, Met. Phe, Pro. Ser. Thr. Trp, Tyr or Val; Xaa62 is Ala, Arg, 
Asn. Asp. Cys. Gta. Gta, Gly. His. De. Leu. Lys. Met, Phe. Pro. Ser. Thr. Tip. Tyr or Val; Xaa63 
is Ala, Arg, Asn, Asp. Cys, Glu. Gta, Gly, His, De, Leu, Lys. Met. Phe. Pro, Ser. Thr. Trp. Tyr. 
Val or a peptide bond; Xaa64 is Ala, Arg. Asn, Asp, Cys. Ghi. Gta. Gly. His. De. Leu. Lys. Met. 
1 5 Phe. Pro. Ser, Thr, Tip, Tyr, Val or a peptide bond; Xaa66 is Ala, Arg, Asn, Asp, Cys, Gta. Gta, 
Gly. His, De, Leu, Lys, Met. Phe, Pro, Ser. Thr. Trp. Tyr or Val; Xaa67 is Ala. Arg. Asn. Asp. 
Cys, Glu, Gta, Gly, His, De, Leu, Lys. Met. Phe, Pro. Ser. Thr, Trp, Tyr or Val; Xaa68 is Ala. 
Gly, Pro, Ser, or Thr. Xaa69 is Arg, Asn. Asp, Gta. Glu. His, Lys. Ser or Thr; Xaa70 is Ala. Gly, 
Pro, Ser, or Thr; Xaa71 is Ala, Arg. Asn, Asp, Cys, Gta, Gin, Gly, His, De. Leu, Lys, Met, Phe, 
20 Pro, Ser, Thr, Trp, Tyr or Val; Xaa75 is Ala, Arg, Asn, Asp, Cys. Gta, Gin. Gly, His, De. Leu. 
Lys, Met. Phe. Pro. Ser. Thr, Trp, Tyr or Val, Xaa76 is Arg or Lys; Xaa77 is Cys, De, Leu, Met. 
Phe. Trp. Tyr or Val; XaaSO is De. Leu. Met or Val; Xaa82 is De. Leu, Met or Val; Xaa84 is Ala, 
Arg, Asn, Asp. Cys. Gta, Gta, Gly, His, De. Leu. Lys. Met. Phe. Pro, Ser. Thr. Trp. Tyr or Val; 
Xaa85 is Ala, Aig, Asn. Asp. Cys. Glu. Gin. Gly, His. De, Leu, Lys, Met, Phe. Pro. Ser. Thr. Trp. 
25 Tyr or Val; Xaa86 is Asp or Gta; Xaa87 is Ala. Aig. Asn. Asp. Cys. Glu. Gin. Gly, His. De. Leu. 
Lys, Met, Phe. Pro, Ser. Thr, Trp, Tyr or Val; Xaa88 is Arg, Asn. Asp, Gin, Gta. His, Lys. Ser or 
Thr; Xaa89 is Ala, Arg, Asn, Asp, Cys, Gta, Gin. Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, 
Trp, Tyr or Val; Xaa90 is Arg, Asn, Asp, Gin, Glu, His, Lys, Ser or Thr; Xaa91 is He or Val; 
Xaa92 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, 
30 Tyr or Val; Xaa93 is Cys, He, Leu, Met. Phe, Trp, Tyr or Val; Xaa94 is Arg or Lys; Xaa95 is 
Arg, Asn, Asp, Gin, Glu, His. Lys. Ser or Thr; XaalOO is He or Val; XaalOl is Ala, Arg, Asn, 
Asp, Cys, Gta. Gta, Gly, His. De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaal02 is 
Arg, Asn, Asp, Gin, Glu, His. Lys. Ser or Thr; Xaal03 is Arg. Gin, Glu.or Lys; XaalOS is Ala. 
Gly, Pro, Ser, or Thr; Xaal07 is Ala, Arg, Asn, Asp, Cys, Glu. Gin, Gly, His, De, Uu, Lys, Met, 
35 Phe, Pro. Ser, Thr. Trp. Tyr or Val. 

The Inhibin subgroup pattern, SEQ. ID. No. 28, accommodates the homologies shared 
among members of the Inhibin subgroup identified to date including Inhibin a, Inhibin PA and 
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Inhibin 3B. The generic sequence, shown bdow, includes both the conserved amino adds 
(standard three letter code) as wdl as aheniattve amino adds (Xaa) present at the variable 
positions within the sequence and defined by the rules set forth in Fig. 10. 



Inhibin Subgroup pattern 



Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Gly Trp Xaa 
15 10 15 

Xaa Trp He Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Tyr Cys Xaa Gly 
20 25 30 



Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 



Xaa Xaa Xaa Xaa Xaa Cys Cys Xaa Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
85 90 95 

Xaa Xaa Xaa Asn Xaa Xaa Xaa Xaa Xaa Cys Xaa Cys Xaa. 
100 105 



Each Xaa can be independently selected fi'om a group of one or more spedfied amino adds 
defined as follows, wherdn: Xaa2 is Ala, Arg, Asn, Asp, Cys, Glu. Gta, Gly, His, He, Leu, Lys, 
Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa3 is Arg or Lys; Xaa4 is Ala, Arg, Asn, Asp, Cys. 
Ghj, Gin, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa5 is Ala, Arg, Asn, 
5 Asp, Cys, Glu, Gin, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa6 is 
Cys, He, Leu, Met, Phe, Trp, Tyr or Val; Xaa7 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, His, He, 
Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa8 is He or Val; Xaa9 is Arg, Asn, Asp, 
Ghi, Glu, His, Lys, Ser or Thr, Xaa 1 1 is Arg, Gin, Glu,or Lys; Xaa 12 is Ala, Arg, Asn, Asp, Cys, 
Glu, Ghi, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaal3 is He, Leu, Met 

10 or Val; Xaal6 is Asn, Asp or Glu; Xaal7 is Arg, Asn, Asp, Gin, Glu, His, Lys, Ser or Thr; Xaa20 
is He or Val; Xaa21 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, His, He, Leu. Lys, Met, Phe, Pro, 
Ser, Thr, Trp, Tyr or Val; Xaa23 is Ala, Gly, Pro, Ser, or Thr; Xaa24 is Ala, Gly, Pro, Ser, or 
Thr; Xaa25 is Phe, Trp or Tyr; Xaa26 is Ala, Arg, Asn, Asp, Cys. Glu, Gin, Gly. His, He. Leu, 
Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa27 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, 

15 His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa28 is Arg, Asn, Asp, Gin, Glu, 
His, Lys, Ser or Thr; Xaa3 1 is Arg, Asn, Asp, Gin, Glu, His, Lys, Ser or Thr; Xaa33 is Ala, Arg, 
Asn, Asp, Cys, Glu, Gin, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa35 
is Ala, Gly, Pro, Ser, or Thr; Xaa36 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, His, He, Leu, Lys, 
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Met, Phc. Pro. Ser, Thr. Tip, Tyr or Val; Xaa37 is Hs, Phe, Trp or Tyr, Xaa38 is De. Leu. Met 
or Val; Xaa39 is Ala, Gly, Pro. Ser, or Thr. Xaa40 is Ala. Gly. Pro. Ser. or Thr. Xaa41 is Ala. 
Afg. Asn. Asp, Cys. Ghi. Gto, Gly, His, De, Leu. Lys, Met, Phe, Pro. Ser, Thr, Trp, Tyr or Val; 
Xaa42 is Ala. Arg. Asn, Asp, Cys, Glu, Gin, Gly, Ks, De, Leu, Lys. Met, Phe, Pro, Ser, Thr. T^p. 
Tyr or Val; Xaa43 is Ala. Gly, Pro, Ser, or Thr, Xaa44 is Ala. Arg, Asn, Asp, Cys, Ghi, Gta, Gly,' 
Ks, De. Leu. Lys. Met, Phe, Pro, Ser. Thr, Trp. Tyr or Val; Xaa45 is Ala. Gly, Pro, Sa, or Thr; * 
Xaa46 is Ala. Arg. Asn, Asp, Cys. Glu, Gto, Gly. His, De, Leu, Lys. Met, Phe, Pro,'ser,'Thr, Tii, 
Tyr or Val; Xaa47 is Ala, Gly, Pro. Ser, or Thr. Xaa48 is Ala, Arg, Asn, Asp. Cys, Glu, Gin, Gly, 
His, De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa49 is Ala, Arg, Asn, Asp, Cys, 
Glu, Gto. Gly, Ms, De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; XaaSO is Ala, Gly, 
Pro, Ser. or Thr; XaaSl is Ala, Gly. Pro, Ser, or Thr, Xaa52 is Ala, Arg, Asn, Asp, Cys, Glu, Gta, 
Gly, His, De, Leu, Lys, Met, Phe, Pro, Ser. Thr. Trp, Tyr or Val; Xaa53 is Ala, Arg, Ajai, Aii>, 
Cys, Glu, Gta, Gly, His, De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa54 is Ala, 
Arg, Asn, Asp, Cys. Gta. Gto, Gly, His. De. Leu, Lys, Met, Phe, Pro, Ser, Thr, Tip, Tyr or Val; 
1 5 Xaa55 is Arg. Asn, Asp, Gin, Glu, Ms, Lys. Ser or Thr, Xaa56 is Ala, Arg, Asn, Asp. Cys, Glu! 
Gto, Gly, Ms, De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa57 is Ala, Arg, Asn, 
Asp, Cys, Glu. Gto, Gly, Ms. De, Leu, Lys. Met. Phe. Pro, Ser, Thr, Trp, Tyr or Val; Xaa58 is 
Ala, Arg, Asn, Asp, Cys, Gto. Gto, Gly, Ms, De, Leu. Lys, Met. Phe, Pro, Ser, Thr. Trp. Tyr or 
Val; Xaa59 is Ala, Arg, Asn, Asp, Cys, Gto, Gto, Gly, His, De. Leu, Lys, Met, Phe, Pro, Ser, Thr. 
20 Trp, Tyr or Val; Xaa60 is Ala, Arg, Asn, Asp. Cys, Gto, Gto, Gly, Ms. De, Leu, Lys. Met, Phe. 
Pro, Ser, Thr, Trp, Tyr, Val or a peptide bond; Xaa61 is Ala, Arg, Asn. Asp. Cys. Glu. Gin, Gly, 
Ms, De, Leu, Lys, Met, Phe, Pro, Ser, Thr. Trp, Tyr, Val or a peptide bond; Xaa62 is Ala, Arg. 
Asn, Asp, Cys. Gto, Gto, Gly. Ms. De. Leu, Lys. Met, Phe, Pro, Ser, Thr, Trp, Tyr, Val or a 
peptide bond; Xaa63 is Ala, Arg, Asn, Asp, Cys, Gto, Gto, Gly, His, De, Leu, Lys, Met, Phe, Pro, 
25 Ser, Thr, Trp, Tyr, Val or a pepUde bond; Xaa64 is Ala, Arg, Asn, Asp, Cys, Glu, Gta, Gly, His, ' 
He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa65 is Ala, Gly, Pro, Ser, or Thr, 
Xaa66 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, Ms, De, Leu, Lys, Met, Phe, Pro, Ser, Th^. Trp. 
Tyr or Val. Xaa67 is Ala, Arg, Asn, Asp, Cys, Glu, Gto. Gly, Ms, He, Leu, Lys, Met, Phe, Pro, 
Ser, Thr, Trp. Tyr or Val; Xaa68 is Arg, Asn, Asp, Gin, Gto, Ms. Lys, Ser or Thr; Xaa69 is Ala. 
30 Gly, Pro, Ser, or Thr; Xaa72 is Ala, Arg, Asn. Asp, Cys, Glu, Gin. Gly, Ms, He, Leu, Lys, Met, 
Phe, Pro. Ser. Thr. Trp. Tyr or Val; Xaa73 is Ala. Arg. Asn. Asp, Cys, Glu, Gin, Gly, Ms. He. 
Leu. Lys, Met, Phe, Pro. Ser. Thr. Trp, Tyr. Val or a peptide bond; Xaa74 is Ala, Arg. Asn. Asp. 
Cys, Gto, Gto. Gly, Ms. De. Leu. Lys, Met. Phe, Pro, Ser, Tto, Trp, Tyr, Val or a peptide bond; 
Xaa76 is Ala, Gly, Pro, Ser. or Thr. Xaa77 is Arg. Asn. Asp, Gto, Gto. Ms, Lys. Ser or Thr; 
35 Xaa78 is Leu or Met; Xaa79 is Arg, Asn, Asp, Gin, Gto, Ms, Lys, Ser or Thr; Xaa80 is Ala, Gly, 
Pro, Ser, or Thr; Xaa81 is Leu or Met; Xaa82 is Arg, Asn, Asp, Gin, Glu, Ms, Lys, Ser or Thr; 
Xaa83 is He. Leu, Met or Val; Xaa84 is Ala, Arg, Asn, Asp, Cys, Glu. Gto. Gly, Ms, He, Leu, 
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Lys. Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa85 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, 
His, Dc, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa86 is Ala, Arg, Asn, Asp, Cys, 
(Hu, Gin, Giy, His, He, Leu, Lys, Met, Phe, Pro, S», Thr, Tip, Tyr or Val; Xaa87 is Aig, Asn, 
Asp, Gin, Ghi, His, Lys, Set or Thr, Xaa89 is Ala, Arg, Asn, Asp, Cys, Glu, Gin, Gly, His, He. 
Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp. Tyr or Val; Xaa90 is Ala, Arg, Asn, Asp, Cys, Glu, Gto, 
Gly, His, De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, Val or a peptide bond; Xaa91 is Ala, 
Arg, Asn, Asp, Cys, Glu, Ghi, Gly, His, fle. Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; 
Xaa92 is Arg, Asn, Asp, Ghi, Glu, His, Lys, Ser or Thr, Xaa93 is Cys, De, Leu, Met, Phe, Trp, 
Tyr or Val; Xaa94 is Ala, Arg, Asn, Asp, Cys, Glu, Gta, Gly, His, Ue, Leu, Lys, Met, Phe, Pro. 
Ser, Thr, Trp, Tyr or Val; Xaa95 is Ala, Arg, Asn, Asp, Cys. Glu, Gin, Gly, His, De, Leu, Lys, 
Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaa96 is Arg, Gto, Glu,or Lys; Xaa97 is Arg, Asn, Asp. 
Gto, Gto, IBs, Lys, Ser or Thr; Xaa98 is He or Val; Xaa99 is Ala, Arg, Asn, A^, Cys, Glu, CHn, 
Gly, Ms, De. Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr or Val; XaalOl is Leu or Met; Xaal02 
is De, Leu, Met or Val; Xaa 1 03 is Ala, Arg, Asn, Asp, Cys, Glu, Gto, Gly, His, De, Leu, Lys, Met, 
Phe, Pro, Ser, Thr, Trp, Tyr or Val; Xaal04 is Gin or Glu; XaalOS is Arg, Asn, Asp, Gto, Gto, 
His, Lys, Ser or Thr; Xaal07 is Ala or Gly; Xaal09 is Ala, Arg, Asn, Asp, Cys, Qu, Gto, Gly, 
Ks, De, Leu, Lys, Met, Phe. Pro, Ser, Thr, Trp, Tyr or Val. 

n. Polypeptide Linker Considerations. 

In the morphon constructs, polypeptide linker sequences join and maintain the spatial 
relationship of the finger and heel regions thereby helping to maintain the cysteine structural 
motif, i.e., the cysteine pattern and knot structure, which characterizes and is believed to be 
important in matotaining the tertiary structure of each subunit of a TGF-p superfamily member. 

It is contemplated that the polypeptide linkers preferably are about 2-20, and most preferably 
about 3-13 amino acids in length. The polypeptide linkers should be long enough to permit the 
finger and heel regions to assume their proper conformation as defined as being able to bind a 
receptor that typically binds a TGF-p superfamily member. The specific amino acid composition 
of the polypeptide linker may vary, however, cysteine residues preferably should be avoided. 
Cysteine residues in the linker sequence may disulfide bond with the cysteine residues that define 
the knot structure thereby affecting the cysteine structural motif It is contemplated that the 
amino acid sequence of each linker may be based either in whole or in part upon: I) the amino 
acid sequences normally present at the ends of the template finger and heel regions; 2) unrelated 
amino acid sequences, predicted by computer based algorithms to be sufficient to connect the 
ends of the finger and heel regions without affecting the spatial arrangement of the finger and heel 
regions in the morphon construct, 3) multiples or portions of pentomeric repeat units of Gly4Ser 
or Ser4Gly; or 4) a combination thereof 
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Accordingly, the artisan simply may choose an amino add sequence, sudi as a P*tum, based 
upon the amino add sequence that normally would connect the finger and hed regions within a 
teiiq>late molecule. Alternatively, once the appropriate bed and finger regions have been chosen 
in accordance with the prindples desoibed above, the artisan may use a variety of commercially 
5 available computer-based algorithms to predict appropriate link^ sequences capable of linking the 
finger and bed regions. Currently preferred, publicly available computer programs include, but 
are not limited to, CONGEN (Bruccoleri et di. (1987) Biopolvmers 26 : 137) and TWEAK 
(Shenkin et al. (1987) Biopolvmers 26-2053-2085), the disclosures of which are incorporated 
herein by reference. Currently preferred, commerdally available software packages include, but 

10 are not limited to, INSIGHTEL, HOMOLOGY, and DISCOVER, all of which are available from 
Biosym, Inc. Other computer-based programs useful in produdng appropriate linker sequences 
are disclosed in U.S. Patent Nos. 4,704,692; 4,881,175; 4,939,666; 4,946,778; and 5,260^03. 
the disdosures of which are incorporated herdn by reference. The aforemmtioned programs and 
software packages mable the artisan to identify pre-existing amino add sequoices in protein 

15 structure databases or build using molecular simulation and/or thermodynamic calculations amino 
add sequences for linking two polypeptide chains through designated N- and C*termini. In 
addition, as mentioned above, the artisan may simply link the ends of the appropriate finger and 
heel regions with an amino acid sequence based in whole, or in part, on the pentomeric repeat 
units of Gly4Ser or Ser4Gly. Such repeats have been used successfully to construct single-chain 

20 antibody binding sites. Methodologies for designing, making, testing and using linkers based on 
the pentomeric repeats of Gly4Ser or Ser4Gly are disclosed in U.S. Patent. Nos. 5,132,405; 

5,091,513; and 5,258,498 and PCT Application No. US88/01737, the disdosures of wWch are 
incorporated herdn by reference. 

It is contemplated that synthetic morphons may be joined through their N- or C- termini via a 
25 polypeptide linker to a second ftmctional protein domain to produce a multifunctional protein 
construct. For example, a morphon construct may be linked by means of an additional 
polypeptide linker to a single-chain antibody binding site for targeting, or to a chemokine, 
hormone, toxin, or other physiologically active functional protein. For details of the construction 
of these multifunctional proteins using such linkers, see U.S. Patent Nos. 5,132,405 and 
30 5,091,513. 

III. Design and Production of Morphons. 

As mentioned above, the biosynthetic constructs of the invention preferably are 
manufactured by using conventional recombinant DNA methodologies well known and 
thoroughly documented in the art. 
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For example, the constructs may be manu&ctured by the assembly of synthetic nudeotide 
sequences and/or joining DNA restriction fragments to produce a synthetic DNA molecule. The 
DNA molecules then are ligated into an expression vehide, for example an e3q)ression plasmid» 
and transfected into an appropriate host cdl, for example E. coli . The protein encoded by the 
S DNA molecule then is e}q)ressed, purified, folded if necessary, tested in vitro for binding activity 
vnAk a receptor having binding afiBnity for the template TGF-P super&nily memba*, and 
subsequently tested to assess whether the biosynthetic construct mimics the activity of the 
template superfemily member. 

Altemativdy, a library of synthetic DNA constructs can be prepared simultaneously for 

10 exanq>le^ by the assembly of synthetic nudeotide sequences that differ in nudeotide conq>osttion 
in a presdected region. For example, it is contmiplated that during production of a morphon 
construct based upon a specific TGF-P superfamily member, the artisan may choose appropriate 
finger and heel regions for such a super&mily member (for example firom Figs. 5-9). Once the 
appropriate finger and heel regions have been selected, the artisan then may produce synthetic 

1 S DNA encoding these regions, which subsequently may be cormected by linker sequences encoding 
polypeptide linkers. For example, if a plurality of DNA molecules encoding different linker 
sequences are induded into a ligation reaction containing DNA molecules encoding finger and 
hed sequences, by judicious choice of appropriate restriction sites and reaction conditions, the 
artisan may produce a library of DNA constructs wherein each of the DNA constructs encode 

20 finger and heel regions but connected by (tijOTeient linker sequences. The resulting DNAs then are 
ligated into a suitable expression vehicle, i.e., a plasmid usefiil in the preparation of a phage 
display library, transfected into a host cell, and the polypeptides encoded by the synthetic DNAs 
expressed to generate a pool of candidate compounds. The pool of candidate compounds 
subsequently can be screened to identify lead compounds having binding affinity and/or selectivity 

25 for a pre-selected receptor. 

Screening may be performed by passing a solution comprising the candidate compounds 
through a chromatography column containing surface immobilized receptor. Then lead 
compounds with the desired binding specificity are eluted, for example by means of a salt gradient 
and/or a concentration gradient of the template TGF-P superfamily member. Nucleotide 
30 sequences encoding the lead compounds subsequently may be isolated and characterized. Once 
the appropriate nucleotide sequences of the lead compounds have been identified, the lead 
compounds subsequently may be produced, either by conventional recombinant DNA or peptide 
synthesis methodologies, in quantities sufficient to test whether the particular construct mimics 
the activity of the template TGF-p superfamily member. 
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It is contemplated that, which ever approach is adopted to produce DNA molecules 
encoding morpheas of the invention, the tertiary structure of the lead compound or compounds 
may subsequently be modulated in order to optimize binding and/or biolo^cal activity by, for 
example, using a combination of nucleotide mutagenesis methodologies aided by the principles 
5 described hercdn and phage display methodologies. Accordingly, an artisan may produce and test 
simultaneously iaige numbers of candidate compounds. The refined compounds produced by this 
approach subsequently may be screened for either agonist or antagonist activity by using 
conventional in vitro and/or in vivo assays rimilar to the assays discussed hereinbelow. 

A. Gene Svnthesis. 

10 The processes for manipulating, amplifying, and recombining DNA which encode amino acid 

sequences of interest generally are well known in the art, and therefore, are not described in detail 
herein. Methods of identifying and isolating genes encoding members of the TGF-P superfamily 
and their cognate receptors also are well understood, and are described in the patent and other 
literature. 

15 Briefly, the construction of DNAs encoding the biosyntheuc constructs cfisclosed herdn is 

perfonned using known techniques involving the use of various restriction enzymes which make 
sequence specific cuts in DNA to produce blunt ends or cohesive ends, DNA ligases, techniques 
enabling enzymatic addition of sticky ends to blunt-ended DNA, construction of synthetic DNAs 
by assembly of short or medium length oligonucleotides, cDNA synthesis techniques, polymerase 

20 chain reaction (PCR) techniques for amplifying appropriate nucleic acid sequences from libraries, 
and synthetic probes for isolating genes of members of the TGF-b superfamily and their cognate 
receptors. Various promoter sequences firom bacteria, mammals, or insects to name a few, and 
other regulatory DNA sequences used in achieving expression, and various types of host cells are 
also known and available. Conventional transfection techniques, and equally conventional 

25 techniques for cloning and subcloning DNA are useful in the practice of this invention and known 
to those skilled in the art. Various types of vectors may be used such as plasmids and viruses 
including animal viruses and bacteriophages. The vectors may exploit various marker genes 
which impart to a successfully transfected cell a detectable phenotypic property that can be used 
to identify which of a family of clones has successfully incorporated the recombinant DNA of the 

30 vector. 

One method for obtaining DNA encoding the biosynthetic constructs disclosed herein is by 
assembly of synthetic oligonucleotides produced in a conventional, automated, oligonucleotide 
synthesizer followed by ligation with appropriate ligases. For example, overlapping, 
complementary DNA fragments may be synthesized using phosphoramidite chemistry, with end 
35 segments left unphosphorylated to prevent polymerization during ligation. One end of the 
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synthetic DNA is left with a "sticky end** corresponding to the site of action of a particular 
restriction endonudease, and the other end is left with an end corresponding to the site of action 
of another restriction endomidease. The con^Iimmtary DNA fragments are ligated togethra* to 
produce a synthetic DNA construct. 

5 Altonatively nudeic add strands encoding finger 1, finger 2 and heel regions may be isolated 

from libraries of nuddc adds, for example, by colony hybridization procedures such as those 
described in Sambrook et sd. eds. (1 989) " Molecular Cloning ". Coldspring Harbor Laboratories 
Press, NY, and/or by PCR amplification methodologies, such as those disclosed in Innis et al. 
(1990) " PCR Protocols. A guide to methods and applications ", Academic Press The nuddc 
1 0 adds encoding the finger and heel regions then are jomed together to produce a synthetic DNA 
encoding the biosynthetic single-chain morphon construct of interest. 

It is appredated, however, that a Ubraiy of DNA constructs encoding a plurality of 
morphons nu^ be produced simultaneously by standard recombinant DNA methodologies, such 
as the ones, described above. For example, the skilled artisan by the use of cassette mutagenesis 

IS or oligonucleotide directed mutagenesis may produce, for example, a series of DNA constructs 
each of which contain different DNA sequences withm a predefined location, e.g., within a DNA 
cassette encoding a linker sequence. The resulting library of DNA constructs subsequently may 
be expressed, for example, in a phage display library (see section V below) and any protein 
constructs that binds to a spedfic receptor may be isolated by aflBnity purification, e.g., using a 

20 chromatographic column comprising surface immobilized receptor (see seaion V below). Once 
molecules that bind the preselected receptor have been isolated, their binding and agonist 
properties may be modulated using the empirical refinement techniques also discussed in section 
V, bdow. 

B. Protdn Expression 

25 If a single DNA construct of interest has been synthesized, it may be integrated into an 

expression vector and transfected into an appropriate host cell for protein expression. Usefiil 
prokaryotic host cells include, but are not limited to, E. coH, and B. Subtilis . Usefiil eukaryotic 
host cells include, but are not limited to, yeast cells, insect cells, myeloma cells, fibroblast 3T3 
cells, monkey kidney or COS cells, Chinese hamster ovary (CHO) cells, mink-lung epithelial cells, 

30 human foreskin fibroblast cells, human glioblastoma cells, and teratocarcinoma cells. 

Alternatively, the synthetic genes may be expressed in a cell-fi-ee system such as the rabbit 
reticulocyte lysate system. 

The vector additionally may include various sequences to promote correct expression of the 
recombinant protein, including transcriptional promoter and termination sequences, enhancer 
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sequences, preferred ribosome binding site sequences, prefmed mRNA leader sequ^ices, 
preferred protein processing sequences, prefixed signal sequences for protein secretion, and the 
like. The DN A sequence encoding the gene of interest also may be manipulated to remove 
potttitially inhibiting sequraces or to minimize unwanted secondary structure formation. The 
5 morphon proteins also may be expressed as fusion proteins. After being translated, the protdn 
may be purified fix>m the cdls themselves or recovered fix>m the culture medhmi and then cleaved 
at a spedfic protease site if so desired. 

For example, if the gene is to be expressed in E. coli. it is doned into an appropriate 
e9q>ression vector. This can be accomplished by positioning the agineered gene downstream of a 

1 0 promoter sequence such as Tip or Tac, and/or a gene coding for a leader peptide such as 
fi:agment B of protdn A (FB). During expression, the resuhing fusion proteins accumulate in 
refractile bodies in the qrtoplasm of the cells, and may be harvested afto" disruption of the cells by 
French press or sonication. The isolated refracdle bodies then are solubilized, and the expressed 
protdns folded and the leader sequence cleaved, if necessary, by methods akeady established with 

1 5 many other recombinant proteins. 

Expression of the engineered genes in eukaryotic cells requires cdls and cell lines that are 
easy to transfect, are capable of stably maintaining foreign DNA with an unrearranged sequence, 
and which have the necessary cellular components for efiBdent transcription, transition, post- 
translation modification, and secretion of the protein. In addition, a suitable vector carrying the 

20 gene of interest also is necessary. DNA vector design for transfection into mammalian cells 

should include appropriate sequences to promote expression of the gene of interest as described 
herein, including appropriate transcription initiation, termination, and enhancer sequences, as well 
as sequences that enhance translation efficiency, such as the Kozak consensus sequence. 
Preferred DNA vectors also include a marker gene and means for amplifying the copy number of 

25 the gene of interest. A detailed review of the state of the art of the production of foreign proteins 
in manmialian cells, including useful cells, protein expression-promoting sequences, marker genes, 
and gene amplification methods, is disclosed in Bendig (1988) Genetic Engineering 7:91-127. 

The best characterized transcription promoters useful for expressing a foreign gene in a 
particular mammalian cell are the SV40 eariy promoter, the adenovirus promoter (AdMLP), the 
30 mouse metallothionein-I promoter (mMT-I), the Rous sarcoma virus (RSV) long terminal repeat 
(LTR), the mouse mammary tumor virus long terminal repeat (MMTV-LTR), and the human 
cytomegalovirus major intermediate-early promoter (hCMV). The DNA sequences for all of 
these promoters are known in the art and are available commercially. 

The use of a selectable DHFR gene in a dhfr cell line is a well characterized method useful 
35 in the amplification of genes in mammalian cell systems. Briefly, the DHFR gene is provided on 
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the vector cairying the gene of interest, and addition of increasing concentrations of the cytotoxic 
drug methotrexate, whidi is metabolized by DHFR, leads to amplification of the DHFR gene copy 
numb^, as wdl as that of the assodated gene of interest. DHFR as a selectable, amplifiable 
marker gene in transfected diinese hamster ovary cell lines (CHO cdls) is particidarly well 
5 characterized in the art. Other useful amplifiable maricer genes indude the adenosine deaminase 
(ADA) and ghitamine synthetase (GS) genes. 

The choice of cells/cdl lines is also important and depends on the needs of the experimenter. 
COS cells provide hig^ levds of transient gene e?q>ression, providing a useful means for rapidly 
screening the biosynthetic constructs of the invention. COS cdls typically are transfected with a 
10 simian virus 40 (SV40) vector carrying the g»e of interest. The transfected COS cdls eventually 
die, thus preventing the long term production of the desired protein product. However, transient 
expression does not require the time consuming process required fi)r the development of a stable 
cell line, and thus provides a useful technique for testing preliminary constructs for binding 
activity. 

15 The various cdls, cell lines and DNA sequences that can be used for mammali^^n cell 

e)q>ression of the single-chain constructs of the invention are well characterized in the art and are 
readily available. Other promoters, selectable markers, gene amplification methods and cdls also 
may be used to express the proteins of this invention. Particular details of the transfection, 
expression, and purification of recombinant proteins are well documented in the art and are 

20 understood by those having ordinary skill in the art. Further details on the various technical 
aspects of each of the steps used in recombinant production of foreign genes in mammalian cell 
expression systems can be found in a number of texts and laboratory manuals in the art, such as, 
for example, F.M. Ausubel etal., ed., Current Protocols in Molecular Biology. John Wiley & 
Sons, New York, (1989). 

25 IV. Screening Binding and Biological Activity. 

Depending upon the cell type used, the DNA encoding the morphon constructs may be 
expressed to produce morphons having either folded or unfolded conformations. For example, it 
is appreciated that the morphon may spontaneously fold into an active conformation during 
production in a host cell, such as a myeloma cell. However, during expression in prokaryotic 
30 organisms, such as E. coll . the protein may form unfolded protein aggregates called inclusion 
bodies. The inclusion bodies, once harvested are subjected in vitro to standard denaturing and 
renaturing conditions to produce a biologically aaive protein conformation. 



Irrespective of which protein expression, harvesting, and, if necessary folding, methodologies 
are used, the resulting proteins that bind preferentially to a preselected receptor subsequently may 
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be idratified using standard methodologies, i.e., ligand/receptor binding assays, well known, and 
thoroughly documented in the art. See for example: Legerski et al. (1992) Biochem. Biophvs. 
Res rnmrn IM- 672-679; Frakar ct al. (1978) Biochem. Biophvs. Res. Comm gQ !849>8S7; 
Chip al. (1990) Nature 343: 266-269: Dahhnan et al. (1988) Biochem 27: 1813-1817; Strader 
5 cl Id. (1989) L BioL Chem. 264: 13572-13578; and DDowd al. (1988) J. BioL Ghent 263 : 
15985-15992. 

Typically, in a ligand/receptor binding assay, the template TGF-p superfamily member of 
interest having a known, quantifiable afiBnity for a preselected receptor is labeled with a detectable 
moiety, for example, a radiolabel, a chromogenic label, or a fluorogenic label. Aliquots of 

10 purified receptor, receptor binding domain Augments, or cells expressing the receptor of interest 
on thdr surface are incubated with the labeled TGF-P superfamily member in the presence of 
various concentrations of the unlabeled morphon. The relative binding a£Snity of a candidate 
morphon may be measured by quantitating the ability of the candidate (unlabded morphon) to 
inhibit the binding of the labeled tonplate TGF-P superfiunily member with the receptor. In 

15 performing the assay, fixed concentrations of the receptor and the TGF-P superfamily member are 
incubated in the presence and absence of unlabeled morphon. Sensitivity may be increased by 
pre-incubating the receptor with the candidate morphon before adding the labeled template TGF- 
P superfamily member. After the labeled competitor has been added, sufficient time is allowed for 
adequate competitor binding, and then free and bound labeled superfamily members are separated 

20 from one another, and one or the other measured. 

Useful labels useful in the practice of the screening procedures include radioactive labels i.e 
125i, 131i, minor 77Br, chromogenic labels, spectroscopic labels such as those disclosed in 
Haughland (1994) " Handbook of Fluorescent and Research Chemicals 5 ed. " by Molecular 
Probes, Inc., Eugene, OR, or conjugated enzymes having high turnover rates, i.e., horseradish 
25 p^oxidase, alkaline phosphatase, or P-galactosidase, used in combination with chemiluminescent 
or fhiorogenic substrates. 

The biological activity, namely the agonist or antagonist properties of the resulting 
constructs may subsequently be characterized using conventional in vivo and in vitro assays that 
have been developed to measure the biological activity of any TGF-P superfamily member. It is 
30 appreciated, however, the type of assay used will depend on the TGF-P superfamily member upon 
which the morphon is based. For example, morphon constructs based upon natural OP-1 protein 
may be assayed using any of the biological assays that have been developed to date for measuring 
OP-1 activity, see, for example, the exemplification's set forth below. 
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V , Rnhaiicement of Binding Activitv. 

Once lead compounds have been identified i.e., those having at least basal binding afiBnity for 
a receptor that normally binds the template TGF-p superfamily member, their affinities may be 
enhanced by a combination of computer-based protein sUiiaure modeling and rational drug 
S design methodologies. For example, after computer modeling of the candidate morphon 

construct, solvent es^posed surfaces of the finger and heel regions or altemativdy linker sequences 
may be mutated to enhance or modulate binding activity and/or specificity. 

During such an approach, it is appreciated that several structural elements may independently 
be mutated in separate constructs. After isolating morphons having the highest binding and/or 

10 biological activity, each of the elements giving rise to ^ihanced activity may be isolated, 
characterized and then combined in a single construct. This type of approach has been used 
successfiilly in the production of a human growth hormone analog that has an affinity for hormone 
receptor 400-fold greater than the wild type human growth honnone (Wells et al.. in " Advances in 
Gene Tedmology: Protean Engineering and Beypntf" plOS). Ahematively, a phirality of regions 

1 S within the protdn may be modified sinmltaneousiy in a single e?q>«iment and constructs having 
the highest binding affinity and/or spedfidty isolated. It is iqjpreciated, however, that several 
rounds of mutagenesis and screening may be performed until a construct having the desired 
binding affinity is found. 

An important aspect of the refinement process is that, when recombinant DNA 
20 methodologies are used it is necessary to establish and maintain a physical connection between the 
polypeptide of interest and the nucleic acid encoding it. A system useful in the practice of this 
aspect of the invention includes, for example, phage display methodologies wherein the peptide of 
interest is inmiobilized on the surface of the phage as a fusion with a phage coat protein, see for 
example, Scott et al. ( 1990) Science 249: 386-390; Parmely etal., (1988) Gene 73 : 305-3 1 8; 
25 Devlin et al.. (1990) Science 249 : 404-406; McCafferty et aL (1990) Nature 348 : 552-554; Marks 
et al. (1992) J. Biol. Chem. 267 : 16007-16010: Lowman et al. (1991) Biochem. 30 : 10832- 
10838; Wells et ah supra : Amberg et al in Strategies in Molecular Biology 6:2-4, and the 
Recombinant Phage Antibody System instruction manual fi-om Pharmacia, P-L Biochemicals. 

Briefly, the DNAs encoding candidate morphon constructs are expressed on the surface of 
30 phage particles, typically as N-terminal fusions to a phage minor coat protein. In a typical 
refinement experiment, it is contemplated that nucleotides encoding specific portions of the 
synthetic gene, e.g., those encoding linker sequences, may be randomized so as to produce a 
library of nucleic acid sequences encoding constructs that have random amino acid sequences in 
that region (See Section 3A). Once a library of DNA molecules encoding morphons has been 
35 produced, the DNAs are ligated into the phage genome at a position adjacent to the gene 
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encoding a minor coat protein. The resulting DNAs are padcaged, amplified in a host cell, and 
expressed to produce recombinant phage partides having on their ceU sur&ce minor coat proteins 
fiised to the proton of interest. The recombinant phage particles that express morphons 
constructs having the desired binding aflEmity and/or spedficity for a receptor of interest 
5 subsequendy are isolated by several rounds of a£Biiityenridmient followed b^ By 
using these approaches, single phage particles expressing polypq>tides having spedfic biological 
properties may be isolated routindy fi^om phage libraries comprising more than 10^-10^ phage 
partides. 

AfiBnity enrichment may, for example, be performed by pasang a solution comprising 

1 0 recombinant phage partides through a chromatography cohmm that comprises sur&ce 

immobilized receptor under conditions which permit morphons expressed on the sur&ce of 
recombinant phage partides to bind to the receptor. Then, the column is washed to remove 
residual and/or non-specifically bound phage. Phage expressing the confonnationally active 
morphons are duted by specific desorption, for example, by addition of excess of the template 

15 TGF-P super£amily member, or by non-specific desorption, for ocample, using pH, polarity- 
redudng agents or chaotropic salts. The highest binding particles may be ehited using 
concentration gradient of the desorption indudng reagent wherdn the highest binding particles 
dute at higher concentrations of the reagent. The highest binding particles, once duted are re- 
amplified, and the nucldc acid encoding the morphon of interest isolated and sequenced. Once 

20 the DNA sequence, and therefore the nucldc acid sequence of the highest binding morphons, has 
been determined, then the refinement process may be repeated, for example, by mutagenesis of 
another portion of the molecule, until morphon molecules having a preferred binding activity have 
been produced. The resulting constructs subsequently may be assayed for biological activity using 
in vivo or in vitro assays that have been developed for each of the template TGF-P superfamily 

25 members identified to date. 

Following the identification of usefiil morphon constructs, the morphons may be produced in 
commerdally usefiil quantities, for example, by producing cell lines that express the morphon of 
interest or by producing synthetic peptides defining the appropriate amino acid sequence. It is 
appreciated, however, that conventional methodologies for producing the appropriate cell lines 
30 and for producing synthetic peptides are well known and thoroughly documented in the art, and 
so are not discussed in detail herein. 

VI. Formulation and Bioactivitv 



35 



The resulting morphon constructs may be provided to an individual as part of a therapy to 
enhance, inhibit, or otherwise modulate the in vivo binding interaction between a TGF-P 
superfamily member and one or more of its cognate receptors. The constructs may be formulated 
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in a pharmaceutical compositioii, as described below, and may be administered by any suitable 
means, preferably directly or systematically, e.g., parentally or orally. 

Whm the morphon construct is to be provided directly (e.g., locally, as by injection, to a 
desired tissue site), or parratally, such as by intravenous, subcutaneous, intramuscular, 
S intraorbital, ophthalmic, intraventricular, intracranial, intracapsular, intraspinal, intracistemal, 
intraperitoneal, buccal, rectal, vaginal, intranasal or by aerosol administration, the therapeutic 
composition preferably comprises part of an aqueous sohition. The solution preferably is 
physiologically acceptable so that in addition to delivery of the desired construct to the patient, 
the solution does not otherwise adversely a£Pect the patient's electrolyte and volume balance. The 
10 aqueous medium for the ihmpcaUc molecule thus may comprise, for example, normal 

physiological saline (0.9% NaCl, O.ISM), pH 7-7.4 or other phannaceutically acceptable salts 
thmof 

Useful solutions for oral or parenteral administration may be prepared by any of the methods 
well known in the pharmaceutical art, described, for example, in "Remington's Phannaceutical 

15 Sciences'*. (Gennaro, A., ed.). Mack Pub., 1990. Formulations may include, for example, 
poiyalkylene glycols such as polyethylene glycol, oils of vegetable origin, hydrogenated 
naphthalenes, and the like. Formulations for direct administration, in particular, may include 
glyco-ol and other compositions of high viscosity. Biocompatible, preferably bioresorbable 
polymers, including, for example, hyaluronic acid, collagen, tricalcium phosphate, polybutyrate, 

20 polylactide, polyglycolide and lactide/glycolide copolymers, may be useful excipients to control 
the release of the morphogen in vivo . 

Other potentially useful parenteral delivery systems for these therapeutic molecules include 
ethylene-vinyl acetate copolymer particles, osmotic pumps, implantable infusion systems, and 
liposomes. Formulations for inhalation administration may contain as excipients, for example, 
25 lactose, or may be aqueous solutions containing, for example, polyoxyethyIene-9-lauryl ether, 
glycocholate and deoxycholate, or oily solutions for administration in the form of nasal drops, or 
as a gel to be applied intransally. 

Finally, therapeutic molecules may be administered alone or in combination with other 
molecules known to effect tissue morphogenesis, i.e., molecules capable of tissue repair and 
30 regeneration and/or inhibiting inflammation. Examples of useful cofactors for stimulating bone 
tissue growth in osteoporotic individuals, for example, include but are not limited to, vitamin D3, 

calcitonin, prostaglandins, parathyroid hormone, dexamethasone, estrogen and IGF-I or IGF-II. 
Useful cofactors for nerve tissue repair and regeneration may include nerve growth factors. Other 
useful cofactors include symptom-alleviating cofactors, including antiseptics, antibiotics, antiviral 
35 and antifungal agents and analgesics and anesthetics. 
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Therapeutic molecules further can be formulated into pharmaceutical compositions by 
admixture with pharmaceutically acceptable nontoxic exdpients and carriers. As noted above, 
such compositions may be prepared for parenteral administration, particularly in the form of liquid 
solutions or suq;)ensions; for oral administration, particularly in the form of tablets or capsules; or 
5 ifitranasally, particularly in the form of powders, nasal drops or ao-osols. Where adhesion to a 
tissue surface is deared the compo^on may indude the biosynthetic construct dispersed in a 
fibrinogen-thrombin composition or other bioadhe^ve such as is disdosed, for example in PCT 
US91/0927S, the disclosure of which is incorporated herein by reference. The composition then 
may be painted, sprayed or otherwise applied to the desired tissue sur&ce. 

10 The compositions can be formulated for parenteral or oral administration to humans or other 

mammals in therapeutically effective amounts, e.g., amounts which provide appropriate 
concentrations of the morphon to target tissue for a time sufSdent to induce the desired effect. 

Where the therapeutic molecule comprises part of a tissue or organ preservation solution, 
any commerdally available preservation sohition may be used to advantage. For example, useful 
1 S solutions known in the art include Collins sohition, Wisconsin solution, Bdzer solution, 
EurocoUins solution and lactated Ringa^s solution. A detailed description of preservation 
sohitions and useful componmts may be found, for example, in U.S. Patent No. 5,002,965, the 
disclosure of which is incorporated herein by reference. 

It is contemplated also that some of the morphon constructs, for example morphons based 
20 upon members of the Vg/dpp subgroup, may exhibit the highest levels of activity in vivo when 
combined with carrier matrices i.e., insoluble polymer matrices. See for example, U.S. Patent No. 
5,266,683 the disclosure of which is incorporated by reference herein. The currently preferred 
carrier matrices are xenogenic, aUogenic or autogenic in nature. It is contemplated, however, that 
synthetic materials comprising polylactic add, polyglycoUc acid, polybutyric acid, derivatives and 
25 copolymers thereof may also be used to generate suitable carrier matrices. Preferred synthetic 
and naturally derived matrix materials, their preparation, methods for formulating them with the 
morphogenic proteins of the invention, and methods of administration are well known in the art 
and so are not discussed in detailed herein. See for example, U.S. Patent No. 5,266,683. 

As will be appreciated by those skilled in the art, the concentration of the compounds 
30 described in a therapeutic composition will vary depending upon a number of factors, including 
the dosage of the drug to be administered, the chemical characteristics (e.g., hydrophobicity) of 
the compounds employed, and the route of administration. The preferred dosage of drug to be 
administered also is likely to depend on such variables as the type and extent of a disease, tissue 
loss or defect, the overall health status of the particular patient, the relative biological efficacy of 
35 the compound selected, the formulation of the compound, the presence and types of excipients in 
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the fonmilatioo, and the route of administration. In general terms, the therapeutic molecules of 
this invention may be provided to and individual where typical doses range fiom about 10 ng/kg 
to about 1 g/kgofbody wdght per day; with a preferred dose range bdng from about 0.1 mg/kg 
to 100 mg/kg of body wdght. 

5 Practice of the invention will be more fiiUy understood from the following eTcamples which 

are for ilhistrative purposes only and, therefore, should not be construed as limiting the invention 
in any way. 

Example I. Design and Production of Morphon Based Upon the TGF-p Superfamily 
Member OP-L 

1 0 An exraiplary OP-I based morphon designed in accordance with the principles described 

herein and having a DNA sequence defined in the Sequence Usting as SEQ. ID. NO.: 29 and 
which mcodes a protein sequence defined in the Sequence Listing as SEQ. ID. NO. 30 (and as 
shown m Figure 1 1) may be produced using standard recombinant DNA methodologies, such as 
those described hereinabove. The DNA sequence encoding the exemplary OP-1 morphon, SEQ. 

15 ID. NO.: 29, comprises a plurality of restriction sites which, should the need arise, ^plifies 

subsequent steps, i.e., cassette mutagenesis, in the empirical refinement of the morphon structure. 

The OP-1 morphon protein sequence set forth in SEQ. ID. NO. 30 comprises from the N- 
teminus to the C-terminus, an OP-1 finger 1 sequ^ice (amino add residues 1-32 in SEQ. ID. NO. 
30); a large linker sequence (amino acid residues 33-44 in SEQ. ID. NO. 30); an OP-1 heel 
20 sequence (amino acid residues 45-75 in SEQ. ID. NO. 30); a small linker sequence (amino acid 
residues 76-79 in SEQ. ID. NO. 30) and an OP-1 finger 2 sequence (amino acid residues 80-1 15 
in SEQ. ID. NO. 30). 

The DNA sequence encoding the OP-1 morphon (SEQ. ID. NO. 30) is defined in SEQ. ID. 
NO. 30 and may be produced by standard recombinant DNA methodologies, for example, 
25 synthesis of synthetic DNA sequences followed by their ligation into an appropriate expression 
vector. It is contemplated, however, that in order to enhance the likelihood of producing an OP-1 
morphon construct having binding and/or agonist properties, the following method may be used. 

Briefly, this method comprises the steps of inserting into a cDNA clone encoding human OP- 
1, an N-terminal linker, a C-terminal linker, a large loop connecting the finger I and heel regions, 
30 and a small loop connecting the heel and finger 2 regions. The modified cDNA subsequently is 
excised from its vector and ligated into a phage display vector, e.g. pCANTAB 5 from Pharmacia, 
and then, in a series of independent mutagenesis steps, in combination with screening in the phage 
display system, the hnker regions are mutated by oligonucleotide directed mutagenesis to produce 
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a pool of candidate morphon constructs. Phage which express on their virion surface morphon 
constnicts that bind the OP* I receptor are isolated and the geae oicoding the morphon constructs 
sequenced. A final constnict comprising finger and heel regions as well as linker regions 
identified firom the highest binding individual constructs then are combined to produce an 
5 optimized OP-l morphon construct. A C-tenninal tail is introduced into the optimized moq>hon 
constnict by oligonucleotide directed mutagenesis after which the resulting morphon construct is 
exdsed fi^om its vector and inserted into an E. ooH expression vector, e.g. pET from New England 
BioLabs, Beverly, MA. A dialled description of this method is set forth below. 

A Smal to BamHI fi-agmmt of the OP-1 cDNA defined in SEQ. ID. NO. 29 (Ozkaynak st 
10 al. (1990) EMBO J. 9: 2085-2093) is cloned into Bhies(Tq>t KS+ (available firom Stratagene 
Cloning Systems, La JoUa, CA) previously cleaved with EcoRV and BamHI. Once transformed 
into E coli. the resulting colonies are screened by a blue-white sdection process wherein the 
desired colonies are blue. The correct done may be identified by restriction screening to give the 
following expected restriction fiagmrats. 



see for «ample, Kunkel, (1985) Proc. Natl. Acad. Sci. USA 822: 488; Kunkel etal. (1985) Meth. 
Enzvmol. 154: 367; and U.S. Patent No. 4,873,192, specific nucleotides are modified vathin the 
OP- 1 cDNA to introduce large and small loops within the OP-1 gene and to introduce unique 
restriction sites at the ends of the OP-1 gene. With all of the following modifications, there is no 
20 frame shift and, therefore, E, coU transformed with the mutagenesis products that give white 
colonies indicate an error in the sequence. 

In a first step, the large loop is introduced by oligonucleotide directed mutagenesis using the 
oligonucleotide set forth in SEQ. ID. NO. 34, The oligonucleotide for the large loop is designed 
so it that has N-terminal and C-terminal over lapping ends 15 bp long, and introduces a unique 
25 Bam HI restriction site which can be used to screen for incorporation. 

Then, the small loop is introduced by oligonucleotide directed mutagenesis using the 
oligonucleotide set forth in SEQ. ID. NO. 33. The oligonucleotide for the small loop is designed 
so it that has N-terminal and C-terminal overlapping ends 18 bp long, and introduces a unique 
Nhel restriction site which can be used to screen for incorporation. 



Restriction Enzyme 

EcoRI 

Xhol 



Fragment size fbp^ 
84, 789, 3425 
161, 1223, 2914 
97. 650, 3551 



SacII 



15 



Using synthetic oligonucleotides and either PGR or the site-directed mutagenesis methods. 
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Then, the C-tenninus of the OP-1 gene is modified by oligonudeotide directed mutagenesis 
using the oligonucleotide set forth in SEQ. ID. NO. 35. The C-terminus is modified to remove 
the native stop codon and introduce a NotI site for cloning into a vector usefiil for producing a 
phage display library, q)edfically pCANTAB S phagemid fi-om Pharmacia. The oligonucleotide 
5 designed for this purpose has N-terminal and C-terminal over lapping ends 14 bp long, and 
introduces both a unique NotI restriction ate and a unique SphI restriction site. Both restriction 
sites can be used to screen for incorporation. 

Then, the N-terminus of the OP-1 gene is modified by oligonucleotide directed mutagenesis 
using the oligonucleotide set forth in SEQ. ID. NO. 32. The N-terminus is modified to introduce 
10 a Sfil restriction site for cloning into the pCANTAB 5 phagemid and an Ndel restriction site for 
eventual cloning into an e?q>ression vector. The oligonucleotide designed for this purpose has N- 
tenninal and C-tmninal overlapping ends 22 bp long. Both the Sfil and the Ndel restriction sites 
can be used to screen for incorporation. 

The OP-1 cDNA resulting fi^om the previous four mutagenesis steps is set forth in SEQ. ID. 
1 5 NO. 37. In order to improve the efBdency of selecting OP-1 morphon constmcts that bind an 
OP-1 receptor, mutagenesis of the large and small loops are performed in independent reactions. 
To enable this step, the resulting cDNA (SEQ. ID. NO. 37) is digested with the restriction 
enzymes Sfil and NotI, and the 376 bp Sfil to NotI fragment is cloned directly into the 
pCANTAB 5 phagemid previously digested vyith the restriction enzymes Sfil and NotI. 

20 Mutations may be introduced into the small loop by oligonucleotide directed mutagenesis 

using oligonucleotides similar to those defined in SEQ. ID. Nos. 38 and 39. Mutations also may 
be introduced into the large loop by oligonucleotide directed mutagenesis using oligonucleotides 
similar to those set forth in SEQ. ID. Nos. 40, 41, 42, and 43. Oligonucleotides defined by SEQ. 
ID. Nos. 40 and 41 introduce changes in the first 20 base pairs defining the large loop while 

25 oligonucleotides defined by SEQ. ID. Nos. 42 and 43 introduce changes into the last half of the 
loop. 

Libraries of phage expressing on their virion surfaces proteins produced by each of the 
separate loop mutagenesis steps are screened for binding activity with OP-1 receptor using a 
protocol set forth in the instructions for use with the "Recombinant Phage Antibody System" 

30 (Pharmacia) . Briefly, the library of clones in pCANTAB 5 phagemid produced by each of the 
separate loop mutagenesis steps are transformed into competent E.coli TGI and grown in the 
presence of KG? helper phage. This step results in the production of phage expressing OP-1 
morphons on their cell surface. The resulting phage are screened for binding activity with OP-1 
receptor (Malpe eLal (1994) Biochem. Biophvs. Res. Comm. 201 : 1 140-1 147; tenDijke etal. 

35 (1994) J. Biol. Chem. 269: 16985-16988) previously immobilized on a solid surface, e.g., Affigel 
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15, Fractrogd, plastic surface, or a magnetic bead (Hennanson etal. in "Immobilized Affinity 
lieand Techniques" (1992) Academic Press, Inc, San Diego). Phage that fid! to bind the rec^tor 
are removed by washing exhaustively (more than 20 washes) with a buflFer containing phosphate 
buffered saline (PBS), followed by exhaustive washes (more than 20 washes) with PBS containing 
0.05% Triton X-100. Then, specific binding constructs are eluted fi-om the solid surface with a 
buffer containing 0.1 M gtydne, pH 3.0, and 0.05% Triton X-100 and the etuate neutralized with 
IM Tris-HCl pH8.0, 0.05 % Triton X-100. Phage having specific affinity for OP-1 are harvested. 
At this stage, however, the eluate may contain several different phage where each phage particle 
can bind OP-1 receptor. 

In order to isolate individual phage particles, E. coli then are reinfi^ed with the ehited 
phage, and infected colonies are titrated out to give individual colonies into microtiter plates. 
Then, the infected E. coli are harvested fi-om each well of the microtiter plate, infected with K07 
helper phage to reinduce the expression of the OP-1 morphon on the surface of the resulting 
phage, and the resulting mixture is added to individual wells of a second microtiter dish contaming 
surfece immobilized OP-1 receptor. Using this approach, individual phage can be screened for 
binding activity with immobilized OP-1 receptor. Binding is visualized with an horse radish 
peroxidase conjugated antibody having binding specificity for an antigen on the surface of the 
phage. The individual phage subsequently are eluted fi-om the OP-1 receptor bound to the surface 
of a well on the microtiter dish and the DNA isolated fi^om them. After sequencing, the DNA 
sequences encoding particular regions, for example, the small and large loops, that generate the 
high binding affinity are combined in a single morphon construct. 

Then, a linker is introduced into the C-terminus of the penultimate morphon gene by 
oligonucleotide directed mutagenesis using the oligonucleotide set forth in SEQ. ID. NO. 36. 
The linker introduces the non-suppressible stop codon TAA at the end of the morphon gene 
"...RACGSH*" and follows the stop codon with a unique Bglll site (AGATCT). Success in 
making this change is determined by the absence of exported phage (the termination codon blocks 
the production of gene 3 protein) and the ability to cleave plasmid DNA with Bglll. Then, the 
optimized OP- 1 morphon gene is excised with Ndel and Bglll and cloned into a pET vector 
(New England Biolabs, Beverly, MA) previously cleaved with Ndel and BamHI. 

Expression of the morphon gene to produce the morphon gene product is induced after the 
expression of T7 RNA polymerase (initiated through infected with XCE6 phage). During 
expression, the OP-1 morphon is produced in inclusion granules which are harvested fi-om the cell 
paste. The protein then is dissolved in 6M guanidine-HCl, 0.2M Tris-HCl, pH 8.2, and 0.1 M 2- 
mercaptoethanol and dialyzed exhaustively against 6M urea, 2.5 mM Tris-HCl, pH 7.5 and 1 mM 
EDTA. 2-mercaptoethanol is added to a final concentration of 0. 1 M and the solution incubated 
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at room temperature. Then, the mixture is dialyzed exhaustively against buffer containing 2.5 mM 
Tris-HCl, pH 7.5 and ImM EDTA. Folded OP-1 morphon then is puriiSed by afBnity 
diromatogr^hy on a cohmm packed with surface immobilized OP-1 recqnor. Unbound material 
is removed by washing as desoibed above and the specific OP-1 receptor binding material is 
5 ehited also as described above. 

Example n. Determination of Binding Activity of an OP-1 Based Morphon. 

Cells e)qpressing an OP-1 receptor on their cell surface are plated into 35 mm dishes and 
incubated for 4S hours in DMEM (Dulbecco*s modified Eagle medium) plus 10% fetal calf serum. 
Purified OP-1, or an OP-l-analog is iodinated with Na^^^I by chloramine T oxidation, preferably 

10 having a specific activity of about 50-100 mCi/mg, by following essentially the protocol of Frolik 
SLaL (1984) J. Biol. Chem. 595 : 10995-1 1000. Labeled OP-l, or OP-l-analog, then is purified 
using a standard procedure, e.g., by chromatographic separation. The plated ceDs then are 
washed twice with physiologically buffered saline in the presence of 0. 1% BSA, and incubated at 
22°C in the presence of BSA, buffer and labeled OP-1 (1 ng) and various concentrations (e.g., 0- 

15 lOmg/ml) of unlabelled competitor, e.g., unlabelled OP-1 or the candidate OP-1 based morphon. 
Following binding, the cells are washed three times with cold buffer, solubilized in 0.5 ml of 0.5 N 
NaOH, removed fi'om the dish, and radioactivity determined by gamma or scintillation counter. 
Data then are expressed as percent inhibition, where 100% inhibition of specific binding is the 
difference between binding in the absence of competitor and binding in the presence of a 100-fold 

20 molar excess of unlabelled competitor. Binding paramet^s prefi^-ably are determined u^ng a 

con4)uter program such as LIGAND (Munsun et al. (1980) Anal. Biochem. 107 : 220-259). Upon 
performance of the assay, it is contemplated that the OP-1 based morphon may have specific 
binding activity for the OP-1 receptor. Upon confirmation of binding activity, the morphon may 
be tested subsequently for biological activity. 

25 Example III. Determination of Biological Activity of an OP-1 Based Morphon. 

The biological activity of the resulting OP-1 based morphons may be determined using any 
of the using standard in vivo and in vitro assays, typically used for assessing native OP-1 activity. 
A variety of exemplary assays are set forth below. 

A. Mitoeenic Effect on Rat and Human Osteoblasts 

30 The following example is designed to demonstrate the ability of OP-1 morphons to induce 

proliferation of osteoblasts in vitro . It is contemplated that in this, and all the other examples 
involving osteoblast cultures, rat osteoblast-enriched primary cultures preferably are used. 
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Although these cultures are heterogeneous in that the individual cells are at different stages of 
differentiation, the culture is believed to more accurately reflect the metabolism and function of 
osteoblasts in vivo than osteoblast cultures obtained from established cell lines. Unless otherwise 
indicated, all chemicals referenced are standard, commerdally available reagents, readily available 
5 from a number of sources, induding Sigma Qiemica], Co., St. Louis; Calbiochem, Corp., San 
Diego and Aldrich Chemical Co., Milwaukee. 

Briefly, rat osteoblast-enriched primary cultures are prepared by sequential collagenase 
digestion of newborn rat calvaria (e.g., from 1-2 day-old animals, Long-Evans strain, Charies 
River Laboratories, Wilmington, MA), following standard procedures, such as are described, for 

1 0 example, in Wong eyd., (1 975) Proc. Natl. Acad. Sci. USA 72 : 3 1 67-3 171. Rat osteoblast single 
cell suspensions then are plated onto a multi-well plate (e.g., a 24 well plate at a concentration of 
50,000 osteoblasts per well) in alpha Ml^ (modified Eagle's medium, Gibco, Inc., Long Island) 
containing 10% FBS (fetal bovine serum), L^hitamine and penicillin/streptomydn. The cells are 
incubated for 24 hours at 37^C, at which time the growth medium is replaced with alpha MEM 

15 containing 1% FBS and the cells incubated for an additional 24 hours so that cdls are in serum- 
dq>rived growth medium at the time of the experiment. 

The cultured cells are divided into four groups: (1) wells which receive, for example, 0. 1, 
1.0, 10.0, 40.0 and 80.0 ng of OP-1 morphon, (2) wells whidi recdve 0.1, 1.0, 10.0 and 40.0 ng 
of wild type OP-1; (3) wells which receive 0.1, 1.0, 10.0, and 40.0 ng of TGF-P, and (4) the 

20 control group, which receive no growth factors. The cells then are incubated for an additional 
1 8 hours after which the wells are pulsed with 2roCiAvell of ^H-thymidine and incubated for sax 
more hours. The excess label then is washed ofif with a cold solution of 0.15 M NaCl and then 
250 ml of 10% tricboloracetic acid is added to each well and the wells incubated at room 
temperature for 30 minutes. The cells then are washed three times with cold distilled water, and 

25 lysed by the addition of 250 ml of 1% sodium dodecyl sulfate (SDS) for a period of 30 minutes at 
2JOQ resulting cell lysates are harvested using standard means and the incorporation of -^H- 
thymidine into cellular DNA determined by liquid scintillation as an indication of mitogenic 
activity of the cells. In the experiment, it is contemplated that the OP-1 morphon construct like 
natural OP-1 will stimulate ^H-thymidine incorporation into DNA, and therefore promote 

30 osteoblast cell proliferation. In contrast, the effect of the TGF-P is expected to be transient and 
biphasic. Furthermore, it is contemplated that at higher concentrations, TGF-P will have no 
significant effect on osteoblast cell proliferation. 



35 



The in vitro effect of the OP-1 morphon on osteoblast proliferation also may be evaluated 
using human primary osteoblasts (obtained from bone tissue of a normal adult patient and 
prepared as described above) and on human osteosarcoma-derived cell lines. 
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P. Progenitor C<?|| Stimulation. 

The following example is designed to demonstrate the ability of OP-1 morphons to stimulate 
the proliferation of mesendqraal progenitor cells. Useful naive stem cells include pluripotential 
stem cdls, which may be isolated from bone marrow or umbilical cord blood using conventional 
5 methodologies, (see, for example. Faradji gUl. (1988) Vox Sang . 55 (3): 133-138 or Broxmeyer 
fiUi. (1989) Proc. Natl Ac ad. Sci. USA, 86: 3828-3832), as weU as naive stem cells obtained 
from blood. Alternatively, embryonic cells (e.g., from a cultured mesodermal cell line) may be 
used. 

Another method for obtaining progenitor cells and for determining the ability of OP-1 
1 0 morphons to stimulate cell proliferation is to capture progenitor cdls from an in vivo source. For 
example, a biocompatible matrix material able to allow the influx of migratory progenitor cells 
may be miplanted at an in vivo site long enough to allow the influx of migratory progenitor cells. 
For example, a bone-derived, guanidine-extracted matrix, formulated as disclosed for example in 
Sampath etah (1983) Proc. Natl. Acad. Sci. USA 80: 6591-6595, or U.S. Patent No. 4,975,526, 
1 5 may be implanted into a rat at a subcutaneous ate, essentially following the metfiod of Sampath et 
al. After three days the implant is removed, and the progenitor cells associated with the matrix 
dispersed and cultured. 

Progenitor cells, however obtained, then are incubated in vitro v^th the candidate OP-1 
morphon under standard cell culture conditions, such as those described hereinbelow. In the 
20 absence of external stimuli, tiie progenitor cells do not, or only minimally, proliferate on their own 
in culture. It is contemplated, however, that progenitor cells cultured in the presence of a 
biologically active OP-1 morphon, like OP-1, will proliferate. Cell growtfi can be determined 
visually or spectrophotometrically using standard methods well known in the art. 

C. Morphoeen-Induced Cell Differentiation. 

25 A variety of assays may be used to determine OP-1 based morphon-induced cellular 

differentiation. 

f n Embryonic Mesenchyme Differentiation 

As with natural OP-1, it is contemplated that OP-1 morphons may be utihzed to induce cell 
differentiation. The ability of OP-1 morphons to induce cell differentiation may be demonstrated 
30 by culturing eariy mesenchymal cells in the presence of OP-1 morphon and then studying the 

histology of the cultured cells by staining with toluidine blue using standard cell culturing and cell 
staining methodologies well described in the art. For example, it is known that rat mesenchymal 
cells destined to become mandibular bone, when separated from the overlying epithelial cells at 
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stage 1 1 and cultured in vitro under standard tissue culture conditions, e.g., in a chemically 
defined, saiim-fi-ee medium, containing for example, 67% DMEM (Dulbecco's modified Eagle's 
medium), 22% F-12 medium, lOmM Hepes pH 7, 2mM glutamine, 50 mg/ml transferrin, 25 
mg/ml insulin, trace elements, 2mg/ml bovine seaim albumin coupled to oldc add, witfa HAT (0. 1 
5 oiM hypoxanthine, lOmM aminoptoin, 12 mM tlqrmidine, will not continue to dififerentiate. 

However, if these same cdls are left in contact with the overlying endodenn for an additional day, 
at wfaidi time they become stage 12 cells, they will continue to differentiate on their own m vitro 
to form chondrocytes. Further differentiation into osteoblasts and, ultimately, mandibular bone, 
requires an appropriate local environment, e.g., a vascularized environment. 

10 It is anticipated that, as with natural OP-1, stage 1 1 mesenchymal cells, cultured in vitro in 

the presence of OP- 1 morphon, e.g., 10-100 ng/ml, will continue to differentiate is vitro to form 
diondrocytes just as they continue to differentiate jfi vitro if they are cultured with the cell 
products harvested fi^om the overlying endodermal cells. This experiment may be performed with 
Afiferent mesenchymal cells to draionstrate the cell differentiation capability of 0P*1 morphon in 

IS different tissues. 

As another example of morphogen-induced cell differentiation, the ability of OP-1 morphons 
to induce osteoblast differentiation may be demonstrated in vitro using primary osteoblast 
cultures, or osteoblast-like cells lines, and assaying for a variety of bone cell markers that are 
specific markers for the differentiated osteoblast phenotype, e.g., alkaline phosphatase activity, 
20 parathyroid hormone-mediated cyclic AMP (cAMP) production, osteocalcin synthesis, and 
enhanced mineralization rates. 

(2) Alkaline Phosphatase Induction of Osteoblasts. 

Cultured osteoblasts in serum-fi'ee medium are incubated with, a range of OP-1 morphon 
concentrations, for example, 0. 1, 1 .0, 1 0.0, 40.0 or 80.0 ng OP-1 morphon/ml medium; or with a 

25 similar concentration range of natural OP-1 or TGF-P. After a 72 hour incubation the cell layer is 
extracted with 0.5 ml of 1% Triton X-IOO. The resultant cell extract, is centrifuged, and 100 ml 
of the extract is added to 90 ml of paranitroso-phenylphosphate (PNfPP)/glycerine mixture and 
incubated for 30 minutes in a 37°C water bath and the reaction stopped with 100 ml NaOH, The 
samples then are run through a plate reader (e.g., Dynatech MR700 plate reader, and absorbance 

30 measured at 400 nm, using p-nitrophenol as a standard) to determine the presence and amount of 
alkaUne phosphate activity. Protein concentrations are determined by the BioRad method. 
Alkaline phosphatase activity is calculated in units/mg protein, where 1 unit=I nmol p-nitrophenol 
liberated/30 minutes at 37X. 
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It is contemplated that the 0P*1 morpbon, like natural OP- 1 alone, will stimulate the 
production of alkaline phosphatase in osteoblasts thereby promoting the growth and e7q)ression of 
fhe osteoblast differentiated phenotype. 

The long term effect of OP- 1 morphon on the production of alkaline phosphatase by rat 
S osteoblasts also may be demonstrated as follows. 

Rat osteoblasts are prepared and cultured in multi-well plates as described above. In this 
exanq>Ie six sets of 24 wdl plates are plated with S0,000 rat osteoblasts per wdl. The wdls in 
each plate, prepared as described above, then are divided into three groups: (1) those which 
receive, for example, 1 ng of OP-1 morphon per ml of medhim; (2) those whidi recdve 40 ng of 

10 OP-1 morphon per ml of medium; and (3) those which received 80 ng of OP-1 morphon per ml of 
medium. Each plate then is incubated for different lengths of time: 0 hours (control time), 
24 hours, 48 hours, 96 hours, 120 hours and 144 hours. After each incubation period, the cell 
layer is extracted with 0.5 ml of 1% Triton X-1 00. The resultant cell extract is centrifuged, and 
alkaline phosphatase activity determined using paranitroso-phmylphosphate (PNPP), as above. It 

IS is contemplated that the OP- 1 morphon, like natural OP- 1 , will stimulate the production of 
alkaline phosphatase in osteoblasts in dose-dependent manner so that increasing doses of OP- 1 
morphon will further increase the level of alkaline phosphatase production. Moreover, it is 
contmplated that the OP-1 morphon-stimulated elevated levels of alkaline phosphatase in the 
treated osteoblasts will last for an tended period of time. 

20 £3} Induction of PTH-Mediated cAMP . 

This experiment is designed to test the effect of OP- 1 morphons on parathyroid hormone- 
mediated cAMP production in rat osteoblasts in vitro . Briefly, rat osteoblasts are prepared and 
cultured in a multiwell plate as described above. The cultured cells then are divided into four 
groups: (1) wells which receive, for example, 1.0, 10.0 and 40.0 ng OP-1 morphon/ml medium); 
(2) wells which receive for example, natural OP-1, at similar concentration ranges; (3) wells 
which receive for example, TGF-P, at similar concentration ranges; and (4) a control group which 
receives no growth factors. The plate then is incubated for another 72 hours. At the end of the 
72 hours the cells are treated with medium containing 0.5% bovine serum albumin (BSA) and 
ImM 3-isobutyl-l-methylxanthine for 20 minutes followed by the addition into half of the wells of 
human recombinant parathyroid hormone (hPTH, Sigma, St. Louis) at a concentration of 
200 ng/ml for 10 minutes. The cell layer then is extracted from each well wth 0.5 ml of 1% 
Triton X-1 00. The cAMP levels then are determined using a radioimmunoassay kit (e.g., 
Amersham, Arlington Heights, Illinois). It is contemplated that OP-1 morphon alone, like OP-1, 
will stimulate an increase in the PTH-mediated cAMP response, thereby promoting the growth 
and expression of the osteoblast differentiated phenotype. 



25 
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£4) Induction of Osteocalcin Productinn 

Osteocaldn is a bone-specific protein syntheazed by osteoblasts which plays an integral role 
in the rate of bone mineralization m vivo. Circulating levels of osteocalcin in serum are used as a 
marker for osteoblast activity and bone formation m vivo . Induction of osteocalcin synthesis in 
5 osteoblast-enriched cultures can be used to demonstrate OP-1 morphon efficacy in vitro . 

Rat osteoblasts are prepared and cultured in a multi-well plate as above. In this experimot 
the medmm is supplemetited with 1 0%FBS, and on day 2, cells are fed with fresh medhmi 
supplemrated with fresh 10 mM 3-glycerophosphate (Sigma, Inc.). Beginning on day 5 and twice 
weekly thereafter, cdls are fed with a con^lete mineralization medhim containing all of the above 

1 0 componmts plus fresh L(+)-ascorbate, at a final concentration of SOmg/ml medhmi. OP-1 

morphon then is added to the wells directly, e.g., in 50% acetonitrile (or 50% ethanol) containing 
0. 1% trifluoroacetic acid (TFA), at no more than 5ml morphon/ml medium. Control wells recdve 
solvent vehicle only. The cells then are re-fed and the conditioned medium sample dihited 1 : 1 in 
standard radioimmunoassay buffer containing standard protease inhibitors and stored at -20T 

1 5 until assayed for osteocalcin. Osteocalcin synthesis is measured by standard radioimmunoassay 
using a conunercially available osteocaldn-specific antibody. 

Mineralization is determined on long term cultures (1 3 day) using a modified von Kossa 
staining technique on fixed ceU layers: cells are fixed in fresh 4% paraformaldehyde at 23 X for 
10 min, following rinsing cold 0.9% NaCl. Fixed cells then are stained for endogenous alkaline 
20 phosphatase at pH 9.5 for 10 min, using a commercially available kit (Sigma, Inc.). Purple stained 
cells then are dehydrated with methanol and air dried. Aftw 30 min incubation in 3% AgN03 in 
the dark, H20-rinsed samples are exposed for 30 sec to 254 nm UV light to develop the black 
silver-stained phosphate nodules. Individual mineralized foci (at least 20 mm in size) are counted 
under a dissecting microscope and expressed as nodules/culture. 

25 It is contemplated that the OP-1 morphon, like natural OP-1 , will stimulate osteocalcin 

synthesis in osteoblast cultures. Furthermore, it is contemplated that the increased osteocalcin 
synthesis in response to OP-1 morphon will be in a dose dependent manner thereby showing a 
significant increase over the basal level after 13 days of incubation. The enhanced osteocalcin 
synthesis also may be confirmed by detecting the elevated osteocalcin mRNA message (20-fold 

30 increase) using a rat osteocalcin-specific probe. In addition, the increase in osteoclacin synthesis 
correlates with increased mineralization in long term osteoblast cultures as determined by the 
appearance of mineral nodules. It is contemplated also that OP-1 morphon, like natural OP-1, will 
increase significantly the initial mineralization rate as compared to untreated cultures. 
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(5) Morphogen-Induced CAM Ejcpression 

Members of the vg/dpp subgroup induce CAM expression, particuiarly N-CAM expression, 
as part of their induction of morphogenesis (see copending USSN 922,8 13). CAMs are 
morphoregulatory molecules identified in all tissues as an essential step in tissue development. N- 
5 CAMs, which con^jrise at least 3 isoforms (N-CAM-180, N-CAM-140 and N-CAM-120. where 
"ISO", "140** and "HO** indicate the apparent molecular weights of the isoforms as measured by 
SDS polyacrylamide gel electrophoresis) are expressed at least transiently in developing tissues, 
and permanently in nerve tissue. Both the N-CAM-180 and N-CAM-140 isoforms are expressed 
in both developing and aduh tissue. The N-CAM-120 isoform is found only in adult tissue. 
10 Another neural CAM is LI. 

The ability of OP-1 based morphons to stimulate CAM expression may be demonstrated 
using the following protocol, using NG 108-1 5 cells. NG 108-1 5 is a transformed hybrid cell line 
(neuroblastoma x glioma, America Type Tissue Culture (ATCC), Rockville, MD), ^chibiting a 
morphology characteristic of transformed embiyonic neurons. As described in Example D, 
15 below, untreated NG108-15 cells exhibit a fibroblastic, or minimally differentiated, morphology 
and express only the 180 and 140 isoforms of N-CAM normally associated with a developing cell. 
Following treatment with members of the vg/dpp subgroup these cells exhibit a morphology 
charaaeristic of adult neurons and express enhanced levels of all three N-CAM isoforms. 

In this example, NGI08-I5 cells are cultured for 4 days in the presence of increasing 
20 concentrations of either the OP-1 morphon or natural OP-1 using standard culturing procedures, 
and standard Western blots performed on whole cell extracts. N-CAM isoforms are detected with 
an antibody which crossreacts with all three isoforms, mAb H28. 123, obtained fi-om Sigma 
Chemical Co., St. Louis, the different isoforms being distinguishable by their differoit mobilities 
on an electrophoresis gel. Control NGI 08-15 cells (untreated) express both the 140 kDa and the 
25 1 80 kDa isoforms, but not the 120 kDa, as determined by Western blot analyses using up to 

100 mg of protein. It is contemplated that treatment of NGI 08- 15 cells with OP-1 morphon, like 
natural OP-1 may result in a dose-dependent increase in the expression of the 180 kDa and 140 
kDa isoforms, as well as the induction of the 120 kDa isoform induced. In addition, it is 
contemplated that the OP-1 morphon, like natural OP-1 -induced CAM expression may correlate 
30 with cell aggregation, as determined by histology. 

fD) OP-1 Morphon-Induced Redifferentiation of Transformed Phenotyp e 



It is contemplated that OP-1 morphon, like natural OP-1, also induces redifferentiation of 
transformed cells to a morphology characteristic of untransformed cells. The examples provided 
below detail morphogen-induced redifferentiation of a transformed human cell line of neuronal 
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origin (NG108-IS); as well as mouse neuroblastoma cells (NlE-l IS), and human etobryo 
carcinoma cells, to a morphology characteristic of untransformed cells. 

As described above, NGl 08- 1 S is a transformed hybrid cell line produced by fu^g 
neuroblastoma x glioma cells (obtained from ATTC, RockviUe, MD)» and e^1)iting a 
5 morphology characteristic of transformed embryonic neurons, e.g., having a fibroblastic 

morphology. Specifically, the celb have polygonal cell bodies, short, spike-like processes and 
make few contacts with neighboring cells. Incubation of NGl 08- 15 cells, cultured in a chemically 
defined, serum-free medium, with 0.1 to 300 ng/ml of OP-1 morphon natural or OP-1 for four 
hours induces an orderiy, dose-dependent change in cell morphology. 

10 For example, NG108-I5 cells are subcultured on poly-L-lysine coated 6 well pl'^tes. Each 

well contmis 40-50,000 cells in 2.5 ml of chemically defined medium. On the third day, 2.5 ml of 
OP-1 morphon or natural OP-I in 60% ethanol containing 0.025% trifluoroacetic is added to eadn 
well. The media is changed daily with new aliquots of morphogen. It is contemplated that OP-1 
morphon Uke OP-1 may induce a dose-dependent redifierentiation of the transformed cells, 

15 inchiding a rounding of the soma, an increase in phase brightness, extension of the short neurite 
processes, and other significant changes in the cellular ultrastructure. After several days it is 
contemplated also that treated ceUs may begin to form epithelioid sheets that then become highly 
packed, multi-layered aggregates, as determined visually by microscopic examination. 

Moreover, it is contemplated that the rediflferentiation may occur without any associated 
20 . changes in DNA synthesis, cell division, or cell viability, making it unlikely that the morphologic 
changes are secondary to cell differentiation or a toxic effect of the morphogen. In addition, it is 
contemplated that the morphon-induced redifierentiation may not inhibit cell division, as 
determined by ^H-thymidine uptake, unlike other molecules such as butyrate, DMSO, retinoic 
acid or Forskolin which have been shown to stimulate differentiation of transformed cells, in 
25 analogous experiments. Thus, it is contemplated that the OP-1 morphon like natural OP-1 may 
maintain cell stability and viabiUty after inducing redifierentiation. 

The morphogen described herein would, therefore, provide useful therapeutic agents for the 
treatment of neoplasias and neoplastic lesions of the nervous system, particularly in the treatment 
of neuroblastomas, including retinoblastomas, and gliomas. 

30 (E) Maintenance of Phenotyp e. 

OP-1 morphons, like natural OP-1 also may be used to maintain a cell's diflFerentiated 
phenotype. This application is particularly useful for inducing the continued expression of 
phenotype in senescent or quiescent cells. 
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(1) In Vitro Model for Phenotvpic Maintenance 

The phenotypic maintenance csq>ability of morphogens is determined readily. A number of 
differentiated cells become senescent or quiescent after multiple passages in vitro under standard 
tissue culture conditions well desoibed in the art (e.g., Culture of A nimal rrffg A Manual of 
5 Basic Techniques. C.R. Freshney, ed., Wiley, 1987). However, if these cells are cultivated in 
vitro in association with a morphogen sudi as OP* I, cells are stimulated to maintain expression of 
their phenotype through multiple passages. For example, the alkaline phosphatase activity of 
cultured osteoblasts, such as cultured osteosarcoma cells and calvaria cells, is ^gnificantly 
reduced after multiple passages in vitro . However, if the cells are cultivated in the presence of 

10 OP-1 alkaline phosphatase activity is maintained over extmded periods of time. Similariy, 
phenotypic expression of myocytes also is maintained in the presence of a morphogen. In the 
experiment^ osteoblasts are cultured as described in Example A The cells are divided into 
groups, incubated with vaiying concoitrations of dther OP-1 morphon or natural OP-1 (e.g., 0- 
300 ng/ml) and passaged multiple times (e.g., 3-5 times) using standard methodology. Passaged 

15 cells then are tested for alkaline phosphatase activity, as described in Example C as an indication 
of differentiated cell metabolic function. It is contemplated that osteoblasts cultured in the 
absence of OP-1 morphon, like natural OP-1, may have reduced alkaline phosphatase activity, as 
compared to OP-1 morphon, or natural OP-1 -treated cells. 

(2) In Vivo Model for Phenotvnic Maintenance 

20 Phenotypic maintenance capability also may be demonstrated in vivo, using a standard rat 

model for osteoporosis. Long Evans female rats (Charles River Laboratories, Wilmington, MA) 
are Sham-operated (control animals) or ovariectomized using standard surgical techniques, to 
produce an osteoporotic condition resulting from decreased estrogen production. Following 
surgery, e.g., 200 days after ovariectomy, rats are systemically provided with phosphate buffered 

25 saline (PBS) or morphogen, (e.g., OP-1 morphon, or natural OP-1, 1-100 mg) for 21 days (e.g., 
by daily tail vein injection.) The rats then are sacrificed and serum alkaline phosphatase levels, 
serum calcium levels, and serum osteocalcin levels are determined, using standard methodologies 
as described therein and above. It is contemplated that the OP-1 morphon treated rats, like the 
OP- 1 treated rats may exhibit elevated levels of osteocalcin and alkaline phosphatase activity. It 

30 is contemplated also that histomorphometric analysis on the tibial diasypheal bone may show 
improved bone mass in OP-1 morphon treated animals as compared with untreated, 
ovariectomized rats. 
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F. Proliferation of Progenitor Cell Populations 

Progenitor cells may be stimulated to proliferate in vivo or ex vivo. It is contemplated that 
cells may be stimulated in vivo by injecting or otherwise providing a sterile preparation containiiig 
the OP-I morphon into the individual. For example, the hemopoietic pluripotential stem cell 
S population of an individual may be stimulated to proliferate by injecting or otherwise providing an 
appropriate concentration of OP-1 morphon to the individual's bone marrow. 

Progenitor cells may be stimulated ec vivo by contacting progenitor cells of the population 
to be enhanced with a morphogoiically active OP-I morphon under stmle conditions at a 
concratration and for a time sufficient to stimukite proliferation of the cells. Suitable 

10 concentrations and stimulation times may be determined empirically, essentially following the 

procedure described in Example A, above. It is contemplated that a OP-1 morphon concentration 
of between about 0. 1-100 ng/ml and a stimulation period of from about 10 minutes to about 
72 hours, or, more generally, about 24 hours, typically should be sufficient to stimulate a cell 
population of about 10^ to 10^ cells. The stimulated cells then may be provided to the individual 

15 as, for example, by injecting the cells to an appropriate in vivo locus. Suitable biocompatible 
progenitor cells may be obtained by any of the methods known in the art or described 
hereinabove. 

G. Regeneration ofPamaged or Diseased Tissue 

It is contemplated that OP-1 morphons may be used to repair diseased or damaged 
20 mammalian tissue. The tissue to be repaired preferably is assessed first, and excess necrotic or 
interfering scar tissue removed as needed, e.g., by ablation or by surgical, chemical, or other 
methods known in the medical arts. 

OP-1 morphon then may be provided directly to the tissue locus as part of a sterile, 
biocompatible composition, either by surgical implantation or injection. The morphon also may 

25 be provided systemically, as by oral or parenteral administration. Alternatively, a sterile, 

biocompatible composition containing progenitor cells stimulated by a morphogenically active 
OP-1 morphon may be provided to the tissue locus. The existing tissue at the locus, whether 
diseased or damaged, provides the appropriate matrix to allow the proliferation and tissue-specific 
differentiation of progenitor cells. In addition, a damaged or diseased tissue locus, particularly 

30 one that has been further assaulted by surgical means, provides a morphogenically permissive 
environment. Systemic provision of OP-1 morphon may be sufficient for certain applications 
(e.g., in the treatment of osteoporosis and other disorders of the bone remodeling cycle, as an 
example). 
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In some drcumstances, particularly i^ere tissue damage is extensive, tiie tissue may not be 
capable of providing a suflSdent matrix for cell influx and proliferation. In these instances, it may 
be necessaiy to provide progenitor cells stimulated by the OP-1 morphon to the tissue locus in 
association with a suitable, bioconQ»tible, formulated matrix, prepared by any of the means 
S desoibed below. The matrix preferably is in vivo biodegradable. The matrbc also may be tissue- 
specific and/or may comprise porous particles having dunensions within the range of 70-850mm, 
most preferably 1 50-420mnL 

OP- 1 morphon also may be used to prevent or substantially inhibit immune/inflammatory 
response-mediated tissue danuige and scar tissue formation following an injury. OP-1 morphon 

10 may be provided to a newly injured tissue locus, to induce tissue morphogenesis at the locus, 
preventing the aggregation of migrating fibroblasts into non-differentiated connective tissue. 
Preferably the OP-1 morphon may be provided as a sterile pharmaceutical prqparation injected 
into the tissue locus within five hours of the injury. Where an immune/inflammatory response is 
unavoidably or deliberately induced, as part of^ for example, a surgical or other aggressive dinical 

1 5 therapy, OP-1 morphon preferably may be provided prophylactically to the patient, prior to, or 
concomitant with, the therapy. 

Below are several samples, describmg protocols for demonstrating OP-1 morphon-induced 
tissue morphogenesis in bone, liver, nerve, dentin, cementum and periodontal tissue. 

(1) OP-1 Morphon-induced Bone Morphogenesis. 

20 A particularly usefol manmialian tissue model system for demonstrating and evaluating the 

morphogenic activity of a protein is the endochondral bone tissue morphogmesis model known in 
the art and described, for example, in U.S. Pat. No. 4,968,590 and incorporated herein by 
reference. The ability to induce endochondral bone formation includes the ability to induce 
proliferation and differentiation of progenitor cells into chondroblasts and osteoblasts, the ability 

25 to induce cartilage matrix formation, cartilage calcification, and bone remodeling, and the ability 
to induce formation of an appropriate vascular supply and hematopoeitic bone marrow 
differentiation. 

The local environment in which the morphogenic material is placed is important for tissue 
morphogenesis. As used herein, "local environment" is understood to include the tissue structural 
30 matrix and the environment surrounding the tissue. For example, in addition to needing an 

appropriate anchoring substratum for their proliferation, the cells stimulated by morphogens need 
signals to direct the tissue-specificity of their differentiation. These signals vary for the different 
tissues and may include cell surface markers. In addition, vascularization of new tissue requires a 
local environment which supports vascularization. 
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The foUoiving sets forth various procedures for evahiating the in vivo morphogenic utility of 
OP-1 morphon and OP-1 morphon containing compositions. The compositions may be injected 
or surgically 'naplanied in a mammal, following any of a numbo- of procedures well known in the 
an. For example, surgical unplant bioassays may be performed essmtially fbUowing the 
procedure of San^ath al. (1983) Proc. Natl. Acad. Sd USA 80 6591^595 and U.S. Pat 
No. 4,968,590. 

Histological sectioning and staining is preferred to determine the extent of morphogenesis in 
vivo, particularly m tissue repair procedures. Excised implants are fixed in Bouins Solution, 
onbedded in paraffin, and cut into 6-8 mm sections. Staining with tohiidine blue or 
hemotoxylin/eosin demonstrates clearly the ultimate development of the new tissue. Twdve day 
implants are usually sufficient to determine whether the implants contain newly induced tissue. 

Successful implants exhibit a controlled progression through the stages of induced tissue 
development allowing one to identify and follow the tissue-specific events that occur. For 
example, in endochondral bone formation the stages include: (1) leukocytes on day one; 
(2) mesenchymal cell migration and proliferation on days two and three; (3) chondrocyte 
q)pearance on days five and six; (4) cartilage matrix formation on day seven; (5) cartilage 
calcification on day eight; (6) vascular invasion, ^ipearance of osteoblasts, and formation of new 
bone on days nine and ten; (7) appearance of osteoclastic cells, and the commencement of bone 
remodeling and dissolution of the implanted matrix on days twelve to eighteen; and 
(8) hematopoietic bone marrow dififerentiation in the resulting ossides on day twenty*one. 

In addition to histological evaluation, biological markers may be used as markers for tissue 
morphogenesis. Usefiil markers include tissue-specific enzymes whose activities may be assayed 
(e.g., spectrophotometrically) after homogenization of the implant. These assays may be useful 
for quantitation and for rapidly obtaining an estimate of tissue formation after the implants are 
removed firom the animal. For example, alkaline phosphatase activity may be used as a marker for 
osteogenesis. 

Incorporation of systemically provided OP-1 morphon may be foUowed using labelled 
protein (e.g., radioactively labelled) and determining its localization in the new tissue, and/or by 
monitoring their disappearance from the circulatory system using a standard labeling protocol and 
pulse-chase procedure. OP-1 morphon also may be provided with a tissue-specific molecular tag, 
whose uptake may be monitored and correlated with the concentration of OP-1 morphon 
, provided. As an example, ovary removal in female rats results in reduced bone alkaline 
phosphatase activity, and renders the rats predisposed to osteoporosis (as described in 
Example E). If the female rats now are provided with OP-1 morphon, a reduction in the systemic 
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concentration of caldum may be seen, which corrdates with the presence of the provided OP-1 
morpbon and which is anticipated to correspond with increased alkaline phosphatase activity. 

(2) OP-1 Morphon-Induced Liver Regeneration. 

As another example, a method for inducing morphogenesis of substantially iiyured liver 
5 tissue following a partial hepatectomy utilizing OP-1 morphon is presented. Variations on this 
general protocol may be used to test morphogen activity of OP-1 morphon in other different 
tissues. The general method involves excising an essentially nonregenmting portion of a tissue 
and providing OP-1 morphon, preferably as a soluble pharmaceutical preparation to the excised 
tissue locus, dosing the wound, and examining the site at a future date. Like bone, liver has a 
10 potential to regraerate upon injury during post-fetal life. 

OP-1 morphon, e.g., 1 mg/ml, in a bioconq)atible solution, for example, a purified OP-1 
morphon, is solubilized in 50% ethanol, or compatible solvent, containing 0. 1% trifiuoroacetic 
add, or conqiatible acid. The injectable OP-1 morphon solution is prepared, e.g., by dihiting one 
vohmie of OP-1 morphon solvent-add stock solution with 9 volumes of 0.2% rat serum albumin 
IS in sterile PBS (phosphate-buffered saline). 

In the experiment, growing rats or aged rats (e.g.. Long Evans, Charles River Laboratories, 
Wilmington) are anesthetized by using ketamine. Two of the liver lobes (left and right) are cut 
out (approximately 1/3 of the lobe) and the OP-1 morphon is injected locally at multiple sites 
along the cut ends. The amount of OP-1 morphon injected may be, e.g., 20-100 ^g in 100-1000 
20 ^1 PBS/RSA (phosphate buffered sdine/rat serum albumin) injection buffer. Placebo samples 

comprise injection buffer only. In experimental essays, five rats in each group preferably are used. 
The wound is closed and the rats are allowed to eat normal food and drink tap water. 

After 12 days, the rats are sacrificed and liver regeneration is observed visually, to evaluate 
the effects of the OP-1 morphon on liver regeneration most effectively. It is contemplated that 
25 the OP- 1 morphon injected group may show complete liver tissue regeneration with no sign 

remaining of any cut in the liver. By contrast, the control group into which only PBS is injected, 
show tjrpically only minimal regeneration with the incision remaining in the sample. 

(3) OP-1 Morphon-Induced Dentin. Cementum and Periodontal Ligament Regeneration. 

As still another example, the ability of OP-1 morphons to induce dentinogenesis also may be 
30 demonstrated. To date, the unpredictable response of dental pulp tissue to injury is a basic 

clinical problem in dentistry. Cynomolgus monkeys are chosen as primate models as monkeys are 
presumed to be more indicative of human dental biology than models based on lower non-primate 
mammals. 
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Using standard dental surgical procedures, small areas (e.g., 2nim) of dental pulps are 
surgically exposed by ronoving the enamd and dentin immediately above the pulp (by drilling) of 
sample teeth, performing a partial amputation of the coronal pulp tissue, indudng hcmostasis, 
application of the pulp treatmrat, and sealing and filling the cavity by standard procedures. 

S Pulp treatments used may inchide: an OP-1 morphon dispersed in a carrier matrix; carrier 

matrix alone, and no treatment. Twdve teeth per animal (four for each treatment) are prepared, 
and two ammals are used. At four weeks, teeth are extracted and processed histologically for 
analysis of dentin formation, and/or ground to analyze dentin mineralization. The effect of OP-1 
morphon on osteodentin reparation may be observed visually by comparing with the control 
10 8anq)les. It is contemplated that the OP-1 morphon, like natural OP-1, plus a carrier matrix may 
induce formation of reparative, osteodentin bridges, traversing on surgically e9q>osed healthy 
dCTtal pulps. By contrast, pulps treated with carrier matrix alone, do not form reparative dentin. 

(4) OP-1 Morphon-Induced Nerve Tissue Repair 

As yet another example, the induction of regenerative effects on central nervous system 
1 5 (CNS) repair, by a morphogenically active QP-l morphon, may be demonstrated using a rat brain 
stab model. Briefly, male Long Evans rats are anesthetized and the head area prepared for 
surgery. The calvariae is exposed using standard surgical procedures and a hole drilled toward 
the center of each lobe using a 0.03 5K wire, just piercing the calvariae. 25ml solutions containing 
either OP-1 morphon (25mg), natural OP-1 (25mg) or PBS then is provided to each of the holes 
20 . by Hamilton syringe. Solutions are delivered to a depth approximately 3 mm below the surface, 
into the underlying cortex, corpus callosum and hippocampus. The skin then is sutured and the 
animal allowed to recover. 

Three days post surgery, rats are sacrificed by decapitation and their brains processed for 
sectioning. Scar tissue formation is evaluated by immunofluoresence staining for glial fibrillary 
25 acidic protein, a marker protein for glial scarring, to qualitatively determine the degree of scar 
formation. It is contemplated that the OP- 1 morphon, like natural OP-1, may result in reduced 
levels of glial fibrillary acidic protein in the tissue sections of animals treated with OP-1 morphon, 
evidencing the ability of the morphogen to inhibit glial scar formation, thereby stimulating nerve 
regeneration. 

30 The ability of OP-1 morphon to stimulate peripheral nervous system axonal growth over 

extended distances may be demonstrated using the following model. Neurons of the peripheral 
nervous system can sprout new processes on their own following injury, but without guidance 
these sproutings typically fail to cormect appropriately and die. Where the break is extensive, e.g., 
greater than 5 or 10 nmi, regeneration is poor or nonexistent. Previous experiments with OP-1, 
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show that morphogens stimulate peripheral nervous system axonal growth over ejctended 
distances, allowing repair and r^eneration of damaged peripheral neural pathways. 

In this example OP-1 morphon stimulation of nerve regeneration may be demonstrated iisii^ 
the rat sciatic nerve model The rat sciatic nerve can regenerate spontaneously across a 5 mm 
5 gap, and occasionally across a 10 mm gap, provided that the severed ends are inserted in a saline- 
filled nerve guidance channel In this e)q>eriment, n^e regeneration across at least a 12mm gap 
is tested. 

Adult female Sprague-Dai^ey rats (Charies River Laboratories, Inc.) wrighing 230-250 g 
are anesthetized with intraperitoneal injections of sodium pentobarbital (35 mg/kg body weight). 

1 0 A skin incision is made parallel and just posterior to the femur. The avascular intermuscular plane 
between vastus lateralis and hamstring muscles are entered and followed to the loose fibroareolar 
tissue surrounding the sciatic nerve. The loose tissue is divided longitudinally thereby fi^ng the 
sciatic nerve over its full extent without devascularizing any portion. Under a surgical microscope 
the sciatic nerves are transected with microscissors at mid*thigh and grafted with an OP-1 

1 5 morphon gel graft that separates the nerve stumps by 12 mm. The graft region is encased in a 
alicone tube 20 mm in length with a 1 .5 mm inner diameter, the interior of \n*ich is filled with the 
morphon solution. Specifically, the central 12 mm of the tube consists of an OP-1 morphon gel 
prepared by mfadng I to 5 mg of substantially pure OP-1 morphon produced with approximately 
100 ml of MATRIGEL™ (firom Collaborative Research, Inc., Bedford, MA), an exmicellular 

20 matrix extract derived from mouse sarcoma tissue, and containing solubilized tissue basement 
membrane, including laminin, type IV collagen, heparin sulfate, proteoglycan and entactin, in 
phosphate-buffered saline. The morphon-filled tube then is implanted directly into the defect site, 
allowing 4 mm on each end to insert the nerve stumps. Each stump is abutted against the 
morphon gel and is secured in the silicone tube by three stitches of commercially available surgical 

25 10-0 nylon through the epineurium, the fascicle protective sheath. 

In addition to OP-1 morphon gel grafts, control grafts of empty silicone tubes, silicone tubes 
filled with gel only and "reverse" autografts, wherein 12 mm transected segments of the animal's 
sciatic nerve are rotated 180*" prior to suturing, preferably also are grafted. All experiments 
preferably are performed in quadruplicate. All wounds preferably are closed by wound clips that 
30 are removed after 1 0 days. Rats can be grafted on both legs. At 3 weeks the animals are 

sacrificed, and the grafted segments removed and frozen on dry ice immediately. Frozen sections 
then are cut throughout the graft site, and examined for axonal regeneration by 
immunofluorescent staining using anti-neurofilament antibodies labeled with flurocein (obtained, 
for example, from Sigma Chemical Co., St. Louis). 
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It is contemplated that regeneration of the sciatic nerve may occur across the entire 12 mm 
distance in all graft sites when the g^ is filled with the OP-1 morphon gel. By contrast, empty 
silicone tubes, gel alone and reverse autografts do not show nerve regeneration. 

The inv^ition may be embodied in other specific forms without departing fi'om the spint and 
scope thereof Accordingly, other embodiments are within the following claims. 
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SEOOENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: CREATIVE BICMOLECOLES, INC 

(B) STREET: 45 SOUTH STREET 

(C) CITY: HOPKINTON 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) POSJAL CODE (ZIP) : 01748 

(G) TELEPHONE: 1-508-435-9001 

(H) TELEFAX: 1-508-435-0454 

(I) TELEX: 

(ii) TITLE OF INVENTION: SINGLE-CHAIN ANALOGS OF TGF-B 
SUPERFAMILY (MORPHONS) 

(iii) NUMBER OF SEQUENCES: 43 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PATENT AMINISTRATOR; TESTA, HURWITZ & 

THIBEAULT 

(B) STREET: 125 HIGH STREET, HIGH STREET TOWER 

(C) CITY: BOSTON 
<D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02210 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC cotT^atible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: PITCHER ESQ, EDMUND R 

(B) REGISTRATION NUMBER: 27,829 

(C) REFERENCE/DOCKET NUMBER: CRP-080 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-248-7000 

(B) TELEFAX: 617-248-7100 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 
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(A) NAME/KEY: Protein 
<B) lOCATION: 1..98 

(D) OTHER INFORMATION: /note» "TGP-Bl SEC3UENCE" 
(xi) SEQUENCE DESCRIPTICXf : SEQ ID N0:1: 

Cys Cys Val Arg Gin Leu Tyr lie Asp Phe Arg Lys Asp Leu Gly Trp 
15 10 15 

Lys Trp lie His Glu Pro Lys Gly Tyr His Ala Asn Phe Cys Leu Gly 
20 25 30 

Pro Cys Pro Tyr lie Trp Ser Leu Asp Thr Gin Tyr Ser Lys Val Leu 
35 40 45 

Ala Leu Tyr Asn Gin His Asn Pro Gly Ala Ser Ala Ala Pro Cys Cys 
50 55 60 

Val Pro Gin Ala Leu Glu Pro Leu Pro He Val Tyr Tyr Val Gly Arg 
65 70 75 60 

Lys Pro Lys Val Glu Gin Leu Ser Asn Met He Val Arg Ser Cys Lys 
85 90 95 

Cys Ser 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..98 

(D) OTHER INFORMATION: /note= "TGF-B2 SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Cys Cys Leu Arg Pro Leu Tyr He Asp Phe Lys Arg Asp Leu Gly Trp 
15 10 15 

Lys Trp He His Glu Pro Lys Gly Tyr Asn Ala Asn Phe Cys Ala Gly 
20 25 30 

Ala Cys Pro Tyr Leu Trp Ser Ser Asp Thr Gin His Ser Arg Val Leu 
35 40 45 

Ser Leu Tyr Asn Thr He Asn Pro Glu Ala Ser Ala Ser Pro Cys Cys 
50 55 60 

Val Ser Gin Asp Leu Glu Pro Leu Thr He Leu Tyr Tyr He Gly Lys 
65 70 75 80 
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Thr Pro Lys lie Glu Gin Leu Ser Asn Met lie Val Lys Ser Cys Lys 
B5 90 95 



Cys Ser 



(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYTE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..98 

(D) OTHER INFORMATION: /note» "TGF-B3 SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Cys Cys Val Arg Pro Leu Tyr lie Asp Phe Arg Gin Asp Leu Gly Glu 
15 10 15 

Lys Trp Val His Glu Pro Lys Gly lyr Tyr Ala Asn Phe Cys Ser Gly 
20 25 30 

Pro Cys Pro Tyr Leu Arg Ser Ala Asp Thr Thr His Ser Thr Val lieu 
35 40 45 

Gly Leu Tyr Asn Thr Leu Asn Pro Glu Ala Ser Ala Ser Pro Cys Cys 
50 55 60 

Val Pro Gin Asp Leu Glu Pro Leu Thr lie Leu Tyr Tyr Val Gly Arg 
65 70 75 80 

Thr Pro Lys Val Glu Gin Leu Ser Asn Met Val Val Lys Ser Cys Lys 
85 90 95 

Cys Ser 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 . ,98 

(D) OTHER INFORMATION: /note= "TGF-B4 SEQUENCE" 



1VO96i/4077] 



PCT/US96Am93 



-73- 

(xi) SEQUENCE DBSCRIPTIC»I: SEQ ID NO: 4: 

Cys Cys Val Arg Pro Leu Tyr He Asp Ph Arg Lys Asp Leu Gin Trp 
15 10 15 

Lys Trp He His Glu Pro Lys Gly Tyr Met Ala Asn Phe Cys Met Gly 
20 25 30 

Pro Cys Pro Tyr He Trp Ser Ala Asp Thr Gin Tyr Thr Lys Val Leu 
35 40 45 

Ala Leu Tyr Asn Gin His Asn Pro Gly Ala Ser Ala Ala Pro Cys Cys 
50 55 60 

Val Pro Gin Thr Leu Asp Pro Leu Pro He He Tyr Tyr Val Gly Arg 
^5 70 75 80 

Asn Val Arg Val Glu Gin Leu Ser Asn Met Val Val Arg Ala Cys Lys 
85 90 95 

Cys Ser 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) SniANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 . . 98 

(D) OTHER INFORMATION: /note= -TGF-B5 SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Cys Cys Val Lys Pro Leu Tyr He Asn Phe Arg Lys Asp Leu Gly Trp 
15 10 15 

Lys Trp He His Glu Pro Lys Gly Tyr Glu Ala Asn Tyr Cys Leu Gly 
20 25 30 

Asn Cys Pro Tyr He Trp Ser Met Asp Thr Gin Tyr Ser Lys Val Leu 
35 40 45 

Ser Leu Tyr Asn Gin Asn Asn Pro Gly Ala Ser He Ser Pro Cys Cys 
50 55 60 

Val Pro Asp Val Leu Glu Pro Leu Pro He He Tyr Tyr Val Gly Arg 
65 70 75 80 

Thr Ala Lys Val Glu Gin Leu Ser Asn Met Val Val Arg Ser Cys Asn 
85 90 95 

Cys Ser 
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(2) INFORMATION FOR SEQ ID N0:6: 

(i) SBOOENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATDRE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note« "DPP SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

Cys Arg Arg His Ser Leu Tyx Val Asp Phe Ser Asp Val Gly Trp Asp 
15 10 15 

Asp Trp lie Val Ala Pro Leu Gly Tyr Asp Ala Tyr Tyr Cys His Gly 
20 25 30 

Lys Cys Pro Phe Pro Leu Ala Asp His Phe Asn Ser Thr Asn His Ala 
35 40 45 

Val Val Gin Thr Leu Val Asn Asn Met Asn Pro Gly Lys Val Pro Lys 
50 55 60 

Ala Cys Cys Val Pro Thr Gin Leu Asp Ser Val Ala Met Leu Tyr Leu 
65 70 75 80 

Asn Asp Gin Ser Thr Val Val Leu Lys Asn Tyr Gin Glu Met Thr Val 
85 90 95 

Val Gly Cys Gly Cys Arg 
100 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: I . . 102 

(D) OTHER INFORMATION: /note= "VGl SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Cys Lys Lys Arg His Leu Tyr Val Glu Phe Lys Asp Val Gly Trp Gin 
15 10 15 
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Asn Trp Val He Ala Pro Gin Gly Tyr Met Ala Asn Tyr Cys Tyr Gly 
20 25 30 

Glu Cys Pro Tyr Pro Leu Thr Glu He Leu Asn Gly Ser Asn His Ala 
35 40 45 

He Leu Gin Thr Leu Val His Ser He Glu Pro Glu Asp He Pro Leu 
50 55 60 

Pro Cys cys Val Pro Thr Lys Met Ser Pro He Ser Met Leu Phe Tyr 
65 70 75 80 

Asp Asn Asn Asp Asn Val Val Leu Arg His Tyr Glu Asn Met Ala Val 
85 90 95 

Asp Glu Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note= "VGRl SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Cys Lys Lys Arg His Leu Tyr Val Glu Phe Lys Asp Val Gly Trp Gin 
15 10 15 

Asn Trp Val He Ala Pro Gin Gly Tyr Met Ala Asn Tyr Cys Tyr Gly 
20 25 30 

Glu Cys Pro Tyr Pro Leu Thr Glu He Leu Asn Gly Ser Asn His Ala 
35 40 45 

He Leu Gin Thr Leu Val His Ser He Glu Pro Glu Asp He Pro Leu 

50 55 60 

Pro Cys Cys Val Pro Thr Lys Met Ser Pro He Ser Met Leu Phe Tyr 
65 70 75 80 

Asp Asn Asn Asp Asn Val Val Leu Arg His Tyr Glu Asn Met Ala Val 
85 90 95 

Asp Glu Cys Gly Cys Arg 

100 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTTHEk INFCMIMATION: /note= "SOA SEQDENCE" 
(xi) SEQDENCE DESCRIPTION: SEQ ID NO: 9: 

Cys Gin Met Gin Thr Leu Tyr lie Asp Phe Lys Asp Leu Gly Trp His 
15 10 15 

Asp Trp He He Ala Pro Glu Gly Tyr Gly Ala Phe Tyr Cys Ser Gly 
20 25 30 

Glu Cys Asn Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val His Leu Leu Glu Pro Lys Lys Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Arg Leu Gly Ala Leu Pro Val Leu Tyr His 
^5 70 75 80 

Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Arg Asn Met He Val 
85 90 95 

Lys Ser Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 . ,101 

(D) OTHER INFORMATION: /note= "BMP-2A SEQUENCE" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 
15 10 15 

Asp Trp He Val Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly 
20 25 30 

Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 
35 40 45 
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Ile Val Gin Thr Leu Val Asn Ser Val Asn Ser Lys He Pro Lys Ala 
50 55 60 

Cys Cys Val Pro Thr Glu lieu Ser Ala He Ser Met Leu Tyr Ijeu Asp 
65 70 75 80 

Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu 
65 90 95 



Gly Cys Gly Cys Arg 
100 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 



(ix) FEATORE: 

(A) NAME/KEY: Protein 

(B) LOCATIC»I: 1..103 

(D) OTHER INFORMATION: /note» "BMP3 SEQUENCE" 



(xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 11: 

Cys Ala Arg Arg Tyr Leu Lys Val Asp Phe Ala Asp lie Gly Trp Ser 
1 5 10 15 

Glu Trp lie lie Ser Pro Lys Ser Phe Asp Ala Tyr Tyr Cys Ser Gly 
20 25 30 

Ala Cys Gin Phe Pro Met Pro Lys Ser Leu Lys Pro Ser Asn His Ala 
35 40 45 

Thr He Gin Ser He Val Arg Ala Val Gly Val Val Pro Gly He Pro 
50 55 60 

Glu Pro Cys Cys Val Pro Glu Lys Met Ser Ser Leu Ser He Leu Phe 
65 70 75 80 

Phe Asp Glu Asn Lys Asn Val Val Leu Lys Val Tyr Pro Asn Met Thr 
85 90 95 

Val Glu Ser Cys Ala Cys Arg 
100 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lOl amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
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(ix) FEATDRB: 

(A) NAME/KEY: Protein 

(B) LOCATIOH: 1..X01 

<D) OTHER INFORMATION: /note= ■BMP-4 SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Cys Arg Arg His Ser Leu Tyx Val Asp Phe Ser Asp Val Gly Trp Asn 
15 10 15 

Asp Tzp lie Val Ala Pro Pro Gly lyr Gin Ala Phe Tyr Cys His Gly 
20 25 30 

Asp cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He Pro Lys Ala 
50 55 60 

Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp 
65 70 75 80 

Glu oyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu 
85 90 95 

Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note= "BMP-S SEQUENCE" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
15 10 15 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly 
20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val His Leu Met Phe Pro Asp His Val Pro Lys 
50 55 60 
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Pro Cys Cys Ala Pro Thr Lys lieu Asn Ala lie Ser Val Leu Tyr Phe 
65 70 75 80 

Asp Asp Ser Ser Asn Val He lieu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 

Arg Ser Cys Gly Cys His 
100 

(2) INFORMATICS? FOR SBQ ID NO: 14: 

(i) SECJOBNCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

CO STRANDBDNBSS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(O) OTHER INFORMATION: /note» **BMP-6 SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Cys Arg Lys His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin 
15 10 15 

Asp Trp He He Ala Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly 

20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val His Leu Met Asn Pro Glu Tyr Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 
65 70 75 80 

Asp Asp Asn Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
B5 90 95 

Arg Ala Cys Gly Cys His 

100 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) 



FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..103 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Cys Arg Arg Thr Ser Leu His Val Asn Phe Lys Glu lie Gly Trp Asp 
15 10 15 

Ser Trp He He Ala Pro Lys Asp Tyr Glu Ala Phe Glu Cys Lys Gly 
20 25 30 

Gly Cys Phe Phe Pro Leu Thr Asp Asn Val Thr Pro Thr Lys His Ala 
35 40 45 

He Val Gin Thr Leu Val His Leu Gin Asn Pro Lys Lys Ala Ser Lys 
50 55 60 

Ala Cys Cys Val Pro Thr Lys Leu Asp Ala He Ser He Leu Tyr Lys 
65 70 75 BO 

Asp Asp Ala Gly Val Pro Thr Leu He Tyr Asn Tyr Glu Gly Met Lys 
85 90 95' 

Val Ala Glu Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 16: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATDRE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note- "OP-1 SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
15 10 15 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly 
20 25 30 

Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val His Phe He Asn Pro Glu Thr Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser Val Leu Tyr Phe 

65 70 75 80 

Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 
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Arg Ala Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATIC»f: 1..102 

(D) OTHER INFORMATIC»l: /note^ «0P-2 SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu 
15 10 15 

Asp Trp Val lie Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly 
20 25 30 

Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn Ala l^r Asn His Ala 
35 40 45 

He Leu Gin Ser Leu Val His Leu Met Lys Pro Asn Ala Val Pro Lys 

50 55 60 

Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr 
65 70 75 80 

Asp Ser Ser Asn Asn Val He Leu Arg Lys His Arg Asn Net Val Val 
85 90 95 

Lys Ala Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1. .102 

(D) OTHER INFORMATION: /note= "OP -3 SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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Cys Arg Arg His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp L u 
15 10 15 

Asp Ser Val lie Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Ala Gly 
20 25 30 

Glu Cys He Tyr Pro Leu Asn Ser cys Met Asn Ser Thr Asn His Ala 
35 40 45 

Thr Met Gin Ala Leu Val His Leu Met Lys Pro Asp He He Pro Lvs 
50 55 60 

Val Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Leu Leu Tyr Tyr 
^5 70 75 80 

Asp Arg Asn Asn Asn Val He Leu Arg Arg Glu Arg Asn Met Val Val 
85 90 95 

Gin Ala Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..107 

(D) OTHER INFORMATION: /note«= "GDF-1 SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Cys Arg Thr Arg Arg Leu His Val Ser Phe Arg Glu Val Gly Trp His 
15 10 15 

Arg Trp Val He Ala Pro Arg Gly Phe Leu Ala Asn Phe Cys Gin Glv 
20 25 30 

Thr Cys Ala Leu Pro Glu Thr Leu Arg Gly Pro Gly Gly Pro Pro Ala 
35 40 45 

Leu Asn His Ala Val Leu Arg Ala Leu Met His Ala Ala Ala Pro Thr 
50 55 60 

Pro Gly Ala Gly Ser Pro Cys Cys Val Pro Glu Arg Leu Ser Pro He 
^5 70 75 80 

Ser Val Leu Phe Phe Asp Asn Ser Asp Asn Val Val Leu Arg His Tyr 
85 90 95 

Glu Asp Met Val Val Asp Glu Cys Gly Cys Arg 
100 105 



(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENCnH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..101 

(D) OTHER INFORMATION: /note« "6DF-3 SEQUENCE* 



(xi) SEQUENCE DESCRIPTIC»^ : SEQ ID N0:20: 

Cys His Arg His Gin Leu Phe lie Asn Phe Gin Asp Leu Gly Trp His 
15 10 15 

Lys Tip Val lie Ala Pro Lys Gly Phe Met Ala Asn Tyr Cys His Gly 
20 25 30 

Glu Cys Pro Phe Ser Met Thr Thr Tyr Leu Asn Ser Ser Asn Tyr Ala 
35 40 45 

Phe Net Gin Ala Leu Met His Met Ala Asp Pro Lys Val Pro Lys Ala 
50 55 60 

Val Cys Val Pro Thr Lys Leu Ser Pro lie Ser Met Leu Tyr Gin Asp 
65 70 75 80 

Ser Asp Lys Asn Val lie Leu Arg His Tyr Glu Asp Met Val Val Asp 
85 90 95 

Glu Cys Gly Cys Gly 
100 



(2) INFORMATICS FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /noCe-= -GDF-9 SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

Cys Glu Leu His Asp Phe Arg Leu Ser Phe Ser Gin Leu Lys Trp Asp 
15 10 15 
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Asn Tip He Val Ala Pro His Arg Tyr Asn Pro Arg Tyx Cys Lys Gly 
20 25 30 

Asp Cys Pro Arg Ala Val Arg His Arg Tyr Gly Ser Pro Val His Thr 
35 40 45 

Met Val Gin Asn He He Tyr Glu Lys Leu Asp Pro Ser Val Pro Arg 
50 55 60 

Pro Ser Cys Val Pro Gly Lys Tyr Ser Pro Leu Ser Val Leu Thr He 
65 70 75 80 

Glu Pro Asp Gly Ser He Ala Tyr Lys Glu Tyr Glu Asp Met He Ala 
85 90 95 

Thr Arg Cys Thr Cys Arg 
100 

(2) INFORMATION FOR SEQ ID N0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCOLE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 
<B) LOCATION: 1..105 

(D) OTHER INFORMATION: /note- "INHIBIN-A SEQUENCE" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Cys His Arg Val Ala Leu Asn He Ser Phe Gin Glu Leu Gly Trp Glu 
IS 10 15 

Arg Trp He Val Tyr Pro Pro Ser Phe He Phe His Tyr Cys His Gly 
20 25 30 

Gly Cys Gly Leu His He Pro Pro Asn Leu Ser Leu Pro Val Pro Gly 
35 40 45 

Ala Pro Pro Thr Pro Ala Gin Pro Tyr Ser Leu Leu Pro Gly Ala Gin 
50 55 60 

Pro Cys Cys Ala Ala Leu Pro Gly Thr Met Arg Pro Leu His Val Arg 
65 70 75 80 

Thr Thr Ser Asp Gly Gly Tyr Ser Phe Lys Tyr Glu Thr Val Pro Asn 
85 90 95 

Leu Leu Thr Gin His Cys Ala Cys He 
100 105 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) HAME/KEY: Protein 

(B) LOCATION: 1..106 

(D) OTHER INFORMATION: /note= "INHIBIN-BA SEQUENCE" 
(xi) SEQUENCE DESCRIPTION: 5EQ ID NO:23: 

Cys Cys Lys Lys Gin Phe Phe Val Ser Phe Lys Asp He Gly Trp Asn 
15 10 15 

Asp Trp He He Ala Pro Ser Gly Tyr His Ala Asn Tyr Cys Glu Gly 
20 25 30 

Glu Cys Pro Ser His He Ala Gly Thr Ser Gly Ser Ser Leu Ser Phe 
35 40 45 

His Ser Thr Val He Asn His Tyr Arg Met Arg Gly His Ser Pro Phe 
50 55 €0 

Ala Asn Leu Lys Ser Cys Cys Val Pro Thr Lys Leu Arg Pro Met Ser 
65 70 75 80 

Met Leu Tyr Tyr Asp Asp Gly Gin Asn He He Lys Lys Asp He Gin 
85 90 95 

Asn Met He Val Glu Glu Cys Gly Cys Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: l.,105 

(D) OTHER INFORMATION: /note= "INHIBIN-BB SEQUENCE" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Cys Cys Arg Gin Gin Phe Phe He Asp Phe Arg Leu He Gly Trp Asn 
15 10 15 

Asp Trp He He Ala Pro Thr Gly Tyr Tyr Gly Asn Tyr Cys Glu Gly 
20 25 30 

Ser Cys Pro Ala Tyr Leu Ala Gly Val Pro Gly Ser Ala Ser Ser Phe 
35 40 45 
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His Thr Ala Val Val Asn Gin Tyr Arg Met Arg Gly Leu Asn Pro Gly 
50 55 60 

Thr Val Asn Ser Cys Cys He Pro Thr Lys Leu Ser Thr Met Ser Met 
65 70 75 80 

Leu Tyr Phe Asp Asp Glu Tyr Asn He Val Lys Arg Asp Val Pro Asn 
85 90 95 

Met He Val Glu Glu Cys Gly Cys Ala 
100 105 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: omino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 . . 98 

(D) OTHER INFORMATION: /note= "TGF-B SUBGROUP SEQUENCE 
PATTERN. Each Xaa is independently selected from 
a group of one or more specified amino acids as 
defined in the specification." 



(xi) SEQUENCE DESCRIPTIOT : SEQ ID NO: 25: 

Cys Cys Val Arg Pro Leu Tyr He Asp Phe Arg Xaa Asp Leu Gly Trp 
15 10 15 

Lys Trp He His Glu Pro Lys Gly Tyr Xaa Ala Asn Phe Cys Xaa Gly 
20 25 30 

Xaa Cys Pro Tyr Xaa Trp Ser Xaa Asp Thr Gin Xaa Ser Xaa Val Leu 
35 40 45 

Xaa Leu Tyr Asn Xaa Xaa Asn Pro Xaa Ala Ser Ala Xaa Pro Cys Cys 
50 55 60 

Val Pro Gin Xaa Leu Glu Pro Leu Xaa He Xaa Tyr Tyr Val Gly Arg 

65 70 75 80 

Xaa Xaa Lys Val Glu Gin Leu Ser Asn Met Xaa Val Xaa Ser Cys Lys 
85 90 95 



Cys Ser 



(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TTPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..104 

(D) OTHER INFORMATICMI: /note- "VG/OPP SUBGROUP SEQUENCE 
PATTERN. Each Xaa is independently selected from 
a group of "one or more specified amino acids as 
defined in the specification" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Cys Xaa Xaa Xaa Xaa Leu Tyr Val Xaa Phe Xaa Asp Xaa Gly Trp Xaa 
15 10 15 

Asp Trp He He Ala Pro Xaa Gly Tyr Xaa Ala Xaa Tyr Cys Xaa Gly 
20 25 30 

Xaa Cys Xaa Phe Pro Leu Xaa Xaa Xaa Xaa Asn Xaa Thr Asn His Ala 
35 40 45 

He Xaa Gin Thr Leu Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro 
50 55 60 

Lys Xaa Cys Cys Xaa Pro Thr Xaa Leu Xaa Ala Xaa Ser Xaa Leu Tyr 
65 70 75 eo 

Xaa Asp Xaa Xaa Xaa Xaa Xaa Val Xaa Leu Xaa Xaa Tyr Xaa Xaa Met 
65 90 95 

Xaa Val Xaa Xaa Cys Gly Cys Xaa 
100 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..107 

(D) OTHER INFORMATION: /note= "GDF SUBGROUP PATTERN. 

Each Xaa is independently selected from a group 
of one or more specified amino acids as defined 
in the specification" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Trp Xaa 
15 10 15 
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Xaa Trp Xaa Xaa Ala Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Gly 
20 25 30 

Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 

Pro Xaa Xaa Xaa Xaa Xaa Xaa Cys Val Pro Xaa Xaa Xaa Ser Pro Xaa 
65 70 75 80 

Ser Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr 
85 90 95 

Glu Asp Met Xaa Xaa Xaa Xaa Cys Xaa Cys Xaa 
100 105 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(ix) PEATDRE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..109 

(D) OTHER INFORMATION: /note- "INHIBIN SUBGROUP PATTERN. 

Each Xaa is independently selected from a group of 
one or more specified amino acids as defined in 
the specification" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Gly Trp Xaa 
15 10 15 

Xaa Trp lie Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Tyr Cys Xaa Gly 

20 25 30 

Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

50 55 60 

Xaa Xaa Xaa Xaa Xaa Cys Cys Xaa Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
85 90 95 

Xaa Xaa Xaa Asn Xaa Xaa Xaa Xaa Xaa Cys Xaa Cys Xaa 
100 105 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..348 

(D) OTHER INFORMATION: /prcxiuct* "DNA ENCODING OP-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATGGCTTGTA AGAAGCACGA GCTGTATGTC AGCTTCCGAG ACCTGGGCTG GCAGGACTGG 60 

ATCATCGCGC CTGAAGGCTA CGCCGCCTAC TACTGTGGCG GCTCTGGAGG TGGATCCGGT 120 

GGAGGCTCCG GTGCCTTCCC TCTGAACTCC TACATGAACG CCACCAACCA CGCCATCGTG 180 

CAGACGCTGG TCCACTTCAT CAACCCGGAA ACGGT6CCCA AGCCCGCTAG CGGTGGTTGT 240 

GCGCCCACGC AGCTCAATGC CATCTCCGTC CTCTACTTCG ATGACAGCTC CAACGTCATC 300 

CTGAAGAAAT ACAGAAACAT GGTGGTCCGG GCATGCGGCT CCCACTAA 348 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 . . 115 

(D) OTHER INFORMATION: /note= "OP-1 MORPHON PROTEIN" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly 
15 10 15 

Trp Gin Asp Trp lie He Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr Cys 
20 25 30 

Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Ala Phe Pro Leu 
35 40 45 

Asn Ser Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val 
50 55 60 
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His Phe lie Asn Pro Glu Thr Val Pro Lys Pro Ala Ser Gly Gly Cys 
65 70 75 80 

Ala Pro Thr Gin Leu Asn Ala lie Ser Val Leu Tyr Phe Asp Asp Ser 
85 90 95 

Ser Asn Val lie Leu Lys Lys Tyr Arg Asn Met Val Val Acg Ala Cys 
100 105 110 

Gly Ser His 
115 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1878 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc^f eature 

(B) LOCATION: 1..1878 

(D) OTHER INFORMATION: /product = "OP-1 cDNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



GGGCGCAGCG 


GGGCCCGTCT 


GCAGCAAGTG 


ACCGACGGCC 


GGGACGGCCG 


CCTGCCCCCT 


60 


CTGCCACCTG 


GGGCGGTGCG 


GGCCCGGAGC 


CCGGAGCCCG 


GGTAGCGCGT 


AGAGCCGGCG 


120 


CGATGCACGT 


GCGCTCACTG 


CGAGCTGCGG 


CGCCGCACAG 


CTTCGTGGCG 


CTCTGGGCAC 


180 


CCCTGTTCCT 


GCTGCGCTCC 


GCCCTGGCCG 


ACTTCAGCCT 


GGACAACGAG 


GTGCACTCGA 


240 


GCTTCATCCA 


CCGGCGCCTC 


CGCAGCCAGG 


AGCGGCGGGA 


GATGCAGCGC 


GAGATCCTCT 


300 


CCATTTTGGG 


CTTGCCCCAC 


CGCCCGCGCC 


CGCACCTCCA 


GGGCAAGCAC 


AACTCGGCAC 


360 


CCATGTTCAT 


GCTGGACCTG 


TACAACGCCA 


TGGCGGTGGA 


GGAGGGCGGC 


GGGCCCGGCG 


420 


GCCAGGGCTT 


CTCCTACCCC 


TACAAGGCCG 


TCTTCAGTAC 


CCAGGGCCCC 


CCTCTGGCCA 


480 


GCCTGCAAGA 


TAGCCATTTC 


CTCACCGACG 


CCGACATGGT 


CATGAGCTTC 


GTCAACCTCG 


540 


TGGAACATGA 


CAAGGAATTC 


TTCCACCCAC 


GCTACCACCA 


TCGAGAGTTC 


CGGTTTGATC 


600 


TTTCCAAGAT 


CCCAGAAGGG 


GAAGCTGTCA 


CGGCAGCCGA 


ATTCCGGATC 


TACAAGGACT 


660 


ACATCCGGGA 


ACGCTTCGAC 


AATGAGACGT 


TCCGGATCAG 


CGTTTATCAG 


GTGCTCCAGG 


720 


AGCACTTGGG 


CAGGGAATCG 


GATCTCTTCC 


TGCTCGACAG 


CCGTACCCTC 


TGGGCCTCGG 


780 


AGGAGGGCTG 


GCTGGTGTTT 


GACATCACAG 


CCACCAGCAA 


CCACTGGGTG 


GTCAATCCGC 


840 


GGCACAACCT 


GGGCCTGCAG 


CTCTCGGTGG 


AGACGCTGGA 


TGGGCAGAGC 


ATCAACCCCA 


900 


AGTTGGCGGG 


CCTGATTGGG 


CGGCACGGGC 


CCCAGAACAA 


GCAGCCCTTC 


ATGGTGGCTT 


960 
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TCTTCAAGGC CACGGAGGTC CACTTCCGCA GCATCCGGTC CACGGGGAGC AAACAGCGCA 1020 

GCCAGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT GCGGATGGCC AACGTGGCAG 1080 

AGAACAGCAG CAGCGACCAG AGGCAGGCCT 6TAAGAAGCA CGAGCTGTAT GTCAGCTTCC 1140 

GAGACCTGGG CTGGCAGGAC TGGATCATCG CGCCTGAAGG CTACGCCGCC TACTACTGTG 1200 

AC3GGGGAGTG TGCCTTCCCT CTGAACTCCT ACATGAACX3C CACCAACCAC GCCATCGTGC 1260 

AGACGCTGGT CCACTTCATC AACCCGGAAA CGGTGCCCAA GCCCTGCTGT GCGCCCACGC 1320 

AGCTCAATGC CATCTCCGTC CTCTACTTCG ATGACAGCTC CAACGTCATC CTGAAGAAAT 1380 

ACAGAAACAT GGTGGTCCGG GCCTGTGGCT GCXACTAGCT CCTCCGAGAA TTCAGACCCT 144 0 

TTGGGGCCAA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC AGCAGACCAA 1500 

CTGCCTTTTG TGAGACCTTC CCCTCCCTAT CCCCAACTTT AAAGGTGTGA GAGTATTAGG 1560 

AAACATGAGC AGCATATGGC TTTTGATCAG TTTTTCAGTG GCAGCATCCA ATGAACAAGA 1620 

TCCTACAAGC TGTGCAGGCA AAACCTAGCA GGAAAAAAAA ACAACGCATA AAGAAAAATG 1680 

GCCGGGCCAG GTCATTGGCT GGGAAGTCTC AGCCATGCAC GGACTCGTTT CCAGAGGTAA 1740 

TTATGAGCGC CTACCAGCCA GGCCACCCAG CCGTGGGAGG AAGGGGGCGT GGCAAGGGGT 1800 

GGGCACATTG GTGTCTGTGC GAAAGGAAAA TTGACCCGGA AGTTCCTGTA ATAAATGTCA 1860 

CAATAAAACG AATGAATG 1878 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..6 0 

(D) OTHER INFORMATION: /product= "N TERMINAL LINKER" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
GCCAACGTGG CAGAGAACAG GGCCGAGCGG GCCCATATGG CTTGTAAGAA GCACGAGCTG 60 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc^f eature 

(B) LOCATIC»«: 1..48 

(D) OTHER INFORMATICMJ: /product « "SMALL LOOP LINKER" 



(xi) SEC30ENCE DESCRIPTION: SEQ ID NO: 33: 
GAAACGGTGC CCAAGCCCGC TAGCGGTGGT TGTGCGCCCA CGCAGCTC 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1. .66 

(D) OTHER INFORMATION: /product^ "LONG LOOP LINKER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GCCGCCTACT ACTGTGGCGG CTCTGGAGGT GGATCCGGTG GAGGCTCCGG TGCCTTCCCT 
CTGAAC 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc^feature 

(B) LOCATION: 1 , . 52 

(D) OTHER INFORMATION: /product= "C-TERMINAL LINKER 1" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CATGGTGGTC CGGGCATGCG GCTCCCACGG TGCGGCCGCA GAATTCAGAC CC 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(li) MOLECDLE TYPE: cDNA 

(ix) FEATURE: 

(A) HAME/KEY: misc_f eature 

(B) LOCATION: 1..42 

(D) OTHER INFORMATICS: /product- "C-TERMINAL LINKER 2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CGGGCATGCG GCTCCCACTA AAGATCTGCA GAATTCAGAC CC 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc^f eature 

(B) LOCATION: 1..1392 

(D) OTHER INFORMATION: /product* "MODIFIED OP-1 DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



CCCGGGTAGC 


GCGTAGAGCC 


GGCGCGATGC 


ACGTGCGCTC 


ACTGCGAGCT 


GCGGCGCCGC 


60 


ACAGCTTCGT 


GGCGCTCTGG 


GCACCCCTGT 


TCCTGCTGCG 


CTCCGCCCTG 


GCCGACTTCA 


120 


GCCTGGACAA 


CGAGGTGCAC 


TCGAGCTTCA 


TCCACCGGCG 


CCTCCGCAGC 


CAGGAGCGGC 


180 


GGGAGATGCA 


GCGCGAGATC 


CTCTCCATTT 


TGGGCTTGCC 


CCACCGCCCG 


CGCCCGCACC 


240 


TCCAGGGCAA 


GCACAACTCG 


GCACCCATGT 


TCATGCTGGA 


CCTGTACAAC 


GCCATGGCGG 


300 


TGC^GAGGG 


CGGCGGGCCC 


GGCGGCCAGG 


GCTTCTCCTA 


CCCCTACAAG 


GCCGTCTTCA 


360 


GTACCCAGGG 


CCCCCCTCTG 


GCCAGCCTGC 


AAGATAGCCA 


TTTCCTCACC 


GACGCCGACA 


420 


TGGTCATGAG 


CTTCGTCAAC 


CTCGTGGAAC 


ATGACAAGGA 


ATTCTTCCAC 


CCACGCTACC 


4B0 


ACCATCGAGA 


GTTCCGGTTT 


GATCTTTCCA 


AGATCCCAGA 


AGGGGAAGCT 


GTCACGGCAG 


540 


CCGAATTCCG 


GATCTACAAG 


GACTACATCC 


GGGAACGCTT 


CGACAATGAG 


ACGTTCCGGA 


600 


TCAGCGTTTA 


TCAGGTGCTC 


CAGGAGCACT 


TGGGCAGGGA 


ATCGGATCTC 


TTCCTGCTCG 


660 


ACAGCCGTAC 


CCTCTGGGCC 


TCGGAGGAGG 


GCTGGCTGGT 


GTTTGACATC 


ACAGCCACCA 


720 


GCAACCACTG 


GGTGGTCAAT 


CCGCGGCACA 


ACCTGGGCCT 


GCAGCTCTCG 


GTGGAGACGC 


780 


TGGATGGGCA 


GAGCATCAAC 


CCCAAGTTGG 


CGGGCCTGAT 


TGGGCGGCAC 


GGGCCCCAGA 


840 


ACAAGCAGCC 


CTTCATGGTG 


GCTTTCTTCA 


AGGCCACGGA 


GGTCCACTTC 


CGCAGCATCC 


900 
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GGTCCACXX3G GAGCAAACAG CGCAGCCAGA ACCGCTCCAA GACGCCCAAG AACCAGGAAG 960 

CCCTGCGGAT GGCCAACAGC CCGGCCGAGC GGGCCCATAT GGCrTGTAAG AAGCACGAGC 1020 

TGTATGTCAG CTTCCGAGAC CTGGGCTGGC AGGACTGGAT CATCGCGCCT GAAGGCTACG 1080 

CCXK:CTACTA CTGTGGCGGC TCTGGAGGTG GATCCGGTGG AGGCTCCGGT GCCTTCCCTC 1140 

TGAACTCCTA CATGAACGCC ACCAACCACG CCATCGTGCA GACGCTGGTC CACTTCATCA 1200 

ACCCGGAAAC GGTGCCCAAG CCCGCTAGCG GTGGTTGTGC GCCCACGCAG CTCAATGCCA 1260 

TCTCCGTCCT CTACTTCGAT GACAGCTCCA ACGTCATCCT GAAGAAATAC AGAAACATGG 1320 

TGCrrCCGGGC ATGCGGCTCC CACGGTGCGG CCGCAGAATT CAGACCCTTT GGGGCCAAGT 1380 

TTTTCTGGAT CC 1392 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..48 

(D) OTHER INFORMATION: /product- "SMALL LOOP MUTAGENESIS 
SEQUENCE A- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GAAACGGTGC CCAAGNNCNN TNNCNNTNNT NNTNNGCCCA CGCAGCTC 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..48 

(D) OTHER INFORMATION: /product* "SMALL LOOP MUTAGENESIS 
SEQUENCE B" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



GAAACGGTGC CCAAGCCCNN NNNNNNNNNN TGTGCGCCCA CGCAGCTC 
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(2) INPCmiATIW FOR SEQ ID N0:40: 

(i) SEQUENCE CHARACTERISTICS: 
(A) XjENGTH: 66 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEONESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLBCOLE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1_66 

(D) OTHER INFORMATION: /product^ "LARGE LOOP MUTAGENESIS 
SEQUENCE A" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GCCGCCTACT ACTGTNNCNN CNNTNNANNT NNANNCGGTG GAGGCTCCGG TGCCTTCCCT 
CTGAAC 



60 
66 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SE^yJENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc^feature 

(B) LOCATION: 1. .66 

(D) OTHER INFORMATION: /product= "LARGE LOOP MUTAGENESIS 
SEQUENCE B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 
GCCGCCTACT ACTGTNNNNN NNNNNNNNNN NNNTCCGGTG GAGGCTCCGG TGCCTTCCCT 60 
CTGAAC 66 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .66 

(D) OTHER INFORMATION: /product= "LARGE LOOP MUTAGENESIS 
SEQUENCE C" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



GCCGCCTACT ACTGTGGCGG CTCTGGAGGT NNANNCNNTN NANNCNNCNN TNNCTTCCCT 



60 



CTGAAC 



66 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..66 

(D) OTHER INFORMATION: /product-^ "LARGE LOOP NDTAGENESIS 



SEQUENCE D" 



(Xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



GCCGCCTACT ACTGTGGCGG CTCTGGAGGT GGANNNNNNN NNNNNNNNNN NNNNTTCCCT 



60 



CT6AAC 



66 
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What is claimed is: 

1 I . A dngle-chain synthetic amino add biopolym^ constnict comprising an amino add 

2 sequence 

3 defining finger 1, finger 2, and heel regions together complementary to the ligand 

4 binding interactive surface of a TGF-P superfamily member receptor structure and 

5 having an amino acid sequence which mimics the structure of a member of the TGF- 

6 P superfamily such that said construct preferentially binds said receptor. 

1 2. The construct of daim I wherein, upon binding with said receptor structure, said 

2 construct mimics the bioactivity of a TGF-p superfamily member. 

1 3 . The construct of claim 2 wherein, upon binding with said receptor structure, said 

2 construct initiates a morphogenic cascade. 

1 4. The construct of claim 1 wh»^: 

2 the amino acid sequence of the finger 1 region is folded to define a 

3 secondary structure comprising, in order fi'om the N to C direction, a first beta sheet, a helical 

4 turn, a loop, and a second beta sheet disposed substantially anti-parallel to said first beta sheet; 

5 the amino add sequence of the heel region is folded to define a secondary 

6 structure comprising an alpha helical coil; and 

7 the amino acid sequence of the finger 2 region is folded to define a 

8 secondary structure comprising, in order fi'om the N to C direction, a beta sheet, a loop, and a 

9 second beta sheet disposed substantially anti-parallel to said first beta sheet. 

1 5. The construct of claim 1 or 4, wherein said finger 1 and finger 2 regions comprise anti- 

2 parallel ^ pleated sheets which extend along adjacent, substantially parallel axes, and said heel 

3 region comprises a helical coil disposed about an axes transverse to said parallel axes. 

1 6 The construct of claim 4, wherein the finger 1 sequence is selected fi*om the group of 

2 finger 1 sequences consisting of 



3 


TGF-pi 


SEQ. 


ID. 


No. 


1, residues 2 through 29, 


4 


TGF-P2 


SEQ. 


ED. 


No. 


2, residues 2 through 29, 


5 


TGF-P3 


SEQ. 


ID. 


No. 


3, residues 2 through 29, 


6 


TGF-P4 


SEQ. 


ID. 


No. 


4, residues 2 through 29, 


7 


TGF-35 


SEQ. 


ID. 


No. 


S, residues 2 through 29, 


8 


dpp 


SEQ. 


ID. 


No. 


6, residues 2 through 29, 


9 


Vg-1 


SEQ. 


ED. 


No. 


7, residues 2 through 29, 


10 


Vgr-1 


SEQ. 


ID. 


No. 


8, residues 2 through 29, 
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11 60A SEQ. m.No. 9, residues 2 through 29, 

12 BMP-2A SEQ. ID. No. 10, residues 2 through 29, 

13 BMP3 SEQ. ID. No. 1 1, residues 2 through 29, 

1 4 BMP-4 SEQ. ID. No. 1 2, residues 2 through 29, 

1 5 BMP-5 SEQ. ID. No. 1 3, residues 2 through 29, 

16 BMP-6 SEQ. ID. No. 14, residues 2 through 29, 

1 7 Dorsalin SEQ. ID. No. 1 5, residues 2 through 29, 

18 OP-1 SEQ. ID. No. 1 6, residues 2 through 29, 

19 OP-2 SEQ. ID. No. 17, residues 2 through 29, 

20 OP-3 SEQ. ID. No. 18, residues 2 through 29, 

21 GDF-1 SEQ. ID. No. 19, residues 2 through 29, 

22 GDF-3 SEQ. ID. No. 20, residues 2 through 29, 

23 GDF-9 SEQ. ID. No. 21, residues 2 through 29, 

24 Inhibin a SEQ. ID. No. 22, residues 2 through 29, 

25 Inhibm PA SEQ. ID. No. 23, readues 2 through 29, and 

26 Inhibin PB SEQ. ID. No. 24, residues 2 through 29. 

1 7. The construct of claim 4 or 6, wherein the hed sequence is seleaed from the group of 

2 heel sequences consisting of 

3 TGF-pl SEQ. ID. No. 1, residues 35 through 62, 

4 TGF-P2 SEQ. ID. No. 2, residues 35 through 62, 

5 TGF-P3 SEQ. ID. No. 3. residues 35 through 62, 

6 TGF-P4 SEQ. ID. No. 4, residues 35 through 62. 

7 TGF-P5 SEQ. ID. No. 5, residues 35 through 62. 

8 dpp SEQ. ID. No. 6, residues 35 through 65, 

9 Vg-1 SEQ. ID. No. 7, residues 35 through 65, 

10 Vgr-1 SEQ. ID. No. 8, residues 35 through 65. 

11 60A SEQ. ID. No. 9. residues 35 through 65. 

12 BMP-2A SEQ. ID. No. 10, residues 35 through 64, 

13 BMP3 SEQ. ID. No. 1 1, residues 35 through 66, 

14 BMP-4 SEQ. ID. No. 12. residues 35 through 64. 

15 BMP-5 SEQ. ID. No. 13. residues 35 through 65. 

16 BMP-6 SEQ. ID. No. 14, residues 35 through 65. 

1 7 Dorsalin SEQ. ID. No. 1 5, residues 35 through 65, 

18 OP-1 SEQ. ID. No. 1 6, residues 35 through 65, 

1 9 OP-2 SEQ. ID. No. 1 7, residues 35 through 65, 

20 OP-3 SEQ. ID. No 1 8, residues 35 through 65, 

21 GDF-1 SEQ. ID. No. 1 9, residues 35 through 70, 
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22 GDF-3 SEQ. ID. No. 20, residues 3S through 64. 

23 GDF-9 SEQ. ID. No. 21, residues 35 through 65, 

24 Inhibina SEQ. ID. No. 22, residues 35 through 65, 

25 InhibinPA SEQ. ID. No. 23. residues 35 through 69, and 

26 Inhibin SEQ. ID. No. 24. residues 35 through 68. 

1 8. The construct of claim 4 or 6. wherein the finger 2 sequence is seleaed from the 

2 group of finger 2 sequences consisting of 

3 TGF-pl SEQ. ID. No. 1. residues 65 through 94, 

4 TGF-P2 SEQ. ID. No. 2, residues 65 through 94, 

5 TGF-P3 SEQ. ID. No. 3, residues 65 through 94, 

6 TGF-P4 SEQ. ID. No. 4, residues 65 through 94, 

7 TGF-P5 SEQ. ID. No. 5, residues 65 through 94. 

8 dpp SEQ. ID. No. 6, residues 68 through 98. 

9 Vg-1 SEQ. ID. No. 7, residues 68 through 98, 

10 Vgr-1 SEQ. ID. No. 8, residues 68 through 98, 

11 60A SEQ. ID. No. 9, residues 68 through 98, 

12 BMP-2A SEQ. ID. No. 1 0, residues 67 through 97, 

13 BMP3 SEQ. ID. No. 1 1, residues 69 through 99, 

14 BMP-4 SEQ. ID. No. 12, readues 67 through 97, 

15 BMP-5 SEQ. ID. No. 13, residues 68 through 98, 

16 BMP-6 SEQ. ID. No. 14, residues 68 through 98, 

17 Dorsaiin SEQ. ID. No. 15, residues 68 through 99, 

18 OP- 1 SEQ. ID. No. 1 6, residues 68 through 98, 

1 9 OP-2 SEQ. ID. No. 1 7, residues 68 through 98, 

20 OP-3 SEQ. ID. No. 1 8, residues 68 through 98, 

21 GDF-1 SEQ. ID. No. 19, residues 73 through 103, 

22 GDF-3 SEQ. ID. No. 20, residues 67 through 97, 

23 GDF-9 SEQ. ID. No. 2 1 , residues 68 through 98, 

24 Inhibin a SEQ. ID. No. 22, residues 68 through 101, 

25 InhibinPA SEQ. ID. No. 23, residues 72 through 102, and 

26 Inhibin PB SEQ. ID. No. 24. residues 71 through 101. 

1 9. The construct of claim 7, wherein the finger 2 sequence is selected from the group of 

2 finger 2 sequences consisting of 

3 TGF-pi SEQ. ID. No. 1, residues 65 through 94, 

4 TGF-P2 SEQ. ID. No. 2, residues 65 through 94. 

5 TGF-P3 SEQ. ID. No. 3, residues 65 through 94, 
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6 TGF-P4 SEQ. ID. No. 4, residues 65 through 94, 

7 TGF-pS SEQ. ID. No. 5, residues 65 through 94, 

8 dpp SEQ. ID. No. 6, residues 68 through 98, 

9 Vg-1 SEQ. ID. No. 7. rendues 68 through 98, 

10 Vgr-1 SEQ. ID. No. 8, residues 68 through 98, 

1 1 60A SEQ. ID. No. 9, residues 68 through 98, 

12 BMP-2A SEQ. ID. No. 10. residues 67 through 97, 

13 BMP3 SEQ. ID. No. 11 , residues 69 through 99, 

14 BMP-4 SEQ. ID. No. 12, residues 67 through 97, 

15 BMP-5 SEQ. ID. No. 13, residues 68 through 98, 

16 BMP-6 SEQ. ID. No. 14, residues 68 through 98, 

17 Dorsalin SEQ. ID. No. 15, readues 68 through 99, 

18 OP-1 SEQ. ID. No. 16, residues 68 through 98, 

19 OP-2 SEQ. ID. No. 17, residues 68 through 98, 

20 OP-3 SEQ. ID. No. 1 8, residues 68 through 98, 

21 GDF-1 SEQ. ID. No. 19, residues 73 through 103. 

22 GDF-3 SEQ. ID. No. 20. residues 67 through 97 

23 GDF-9 SEQ. ID. No. 21, residues 68 through 98, 

24 Inhibina SEQ. ID. No. 22. residues 68 through 101, 

25 Inhibin PA SEQ. ID. No. 23, residues 72 through 102, and 

26 Inhibin 3B SEQ. ID. No. 24, residues 71 through 101. 

1 10. The constiuct of claim 4, wherein the finger 1 sequence is selected fi-om the group of 

2 sequences consisting of 

3 dpp SEQ. ID. No. 6, residues 2 through 29, 

4 Vg-1 SEQ. ID. No. 7, residues 2 through 29, 

5 Vgr-1 SEQ. ID. No. 8, residues 2 through 29, 

6 60A SEQ. ID. No. 9, residues 2 through 29. 

7 BMP-2A SEQ. ID. No. 10. residues 2 through 29, 

8 BMP3 SEQ. ID. No. 1 1 , residues 2 through 29, 

9 BMP-4 SEQ. ID. No. 12, residues 2 through 29, 

10 BMP-5 SEQ. ID. No. 1 3, residues 2 through 29, 

1 1 BMP-6 SEQ. ID. No. 1 4, residues 2 through 29, 

1 2 Dorsalin SEQ. ID. No. 1 5 , residues 2 through 29, 

13 OP-1 SEQ. ID. No. 1 6, residues 2 through 29, 

14 OP-2 SEQ. ID. No. 1 7, residues 2 through 29, and 

1 5 OP-3 SEQ. ID. No. 1 8, residues 2 through 29. 
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1 11. The construct of claim 4 or 10, wherein the hed sequence is sdected from the group 

2 of sequences consisting of 

3 dpp SEQ. ID. No. 6, residues 35 through 65, 

4 Vg-1 SEQ. ID. No. 7, residues 35 through 65, 

5 Vgr-1 SEQ. ID. No. 8, residues 35 through 65, 

6 60A SEQ. ID. No. 9, residues 35 through 65, 

7 BMP-2A SEQ. ID. No. 1 0, residues 35 through 64, 

8 BMP3 SEQ. ID. No. 1 1, residues 35 through 66, 

9 BMP-4 SEQ. ID. No. 12, residues 35 through 64, 

10 BMP-5 SEQ. ID. No. 13, residues 35 through 65, 

1 1 BNfP-6 SEQ. ID. No. 14, reddues 35 through 65, 

12 Dorsaiin SEQ. ID. No. 15, residues 35 through 65, 

13 OP-I SEQ. ID. No. 16, residues 35 through 65, 

14 OP-2 SEQ. ID. No. 17, residues 35 through 65, and 

15 OP-3 SEQ. ID. No, 18, residues 35 through 65. 

1 12. The construct of claim 4 or 1 0, wherein the finger 2 sequence is sdected fi'om the 

2 group of sequences consisting of 

3 dpp SEQ. ID. No. 6, residues 68 through 98, 

4 Vg-1 SEQ. ID. No. 7, residues 68 through 98, 

5 Vgr-l SEQ. ID. No. 8, residues 68 through 98, 

6 . 60A SEQ. ID. No. 9, residues 68 through 98, 

7 BMP-2A SEQ. ID. No. 10, residues 67 through 97, 

8 BMP3 SEQ. ID. No. 1 1, residues 69 through 99, 

9 BMP-4 SEQ. ID. No. 12, residues 67 through 97, 

10 BMP-5 SEQ. ID. No. 13, residues 68 through 98, 

11 BMP-6 SEQ. ID. No. 14, residues 68 through 98, 

12 Dorsaiin SEQ. ID. No. 15, residues 68 through 99, 

13 OP-1 SEQ. ID. No. 16, residues 68 through 98, 

14 OP-2 SEQ. ID. No. 17, residues 68 through 98, and 

15 OP-3 SEQ. ID. No. 18. residues 68 through 98. 

1 13. The construct of claim 1 1, wherein the finger 2 sequence is selected from the group of 

2 sequences consisting of 

3 dpp SEQ. ID. No. 6, residues 68 through 98, 

4 Vg-1 SEQ. ID. No. 7. residues 68 through 98, 

5 Vgr-I SEQ. ID. No. 8, residues 68 through 98, 
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6 60A SEQ. ID. No. 9, residues 68 through 98, 

7 BMP-2A SEQ. ID. No. 1 0, residues 67 through 97, 

8 BMP-3 SEQ. ID. No. 1 1, readues 69 through 99, 

9 BMP-4 SEQ. ID. No. 12, residues 67 through 97, 

10 BMP-S SEQ. ID. No. 13, residues 68 thnnigb 98, 

1 1 BNfP-6 SEQ. ID. No. 14, residues 68 through 98, 

12 Dorsalin SEQ. ID. No. 1 5, residues 68 through 99, 

13 OP-1 SEQ. ID. No. 1 6, residues 68 through 98, 

14 OP-2 SEQ. ID. No. 1 7, residues 68 through 98, and 

15 OP-3 SEQ. ID. No. 18, residues 68 through 98. 

1 14. The construct of claim 4, wherdn the finger 1 sequence is OP-I finger 1 sequence 

2 [SEQ. ID. No. 16, residues 2 through 29]. 

1 IS. The constnict of claim 4 or 1 4, wherein the heel sequence is OP- 1 heel sequence 

2 [SEQ. ID. No. 16, residues 35 through 65]. 

1 16. The construct of claim 4 or 14, wherein the finger 2 sequence is OP-1 finger 2 

2 sequence [SEQ. ID. No. 16, residues 68 through 98]. 

1 17. The construct of claim 1 5, wherein the finger 2 sequence is OP-1 finger 2 sequence 

2 [SEQ. ID. No. 16, residues 68 through 98]. 

1 . 18. The construct of claim 4, wherein: 

2 the finger 1 sequence is OP- 1 finger 1 sequence [SEQ. ID. No. 16, residues 

3 2 through 29]; 

4 the heel sequence is OP-1 heel sequence [SEQ. ID. No. 16, residues 35 

5 through 65]; and 

6 the finger 2 sequence is OP- 1 finger 2 sequence [SEQ. ID. No. 16, residues 

7 68 through 98]. 

1 19. The construct of claim 1 wherein: 

2 the finger 1 region comprises a sequence selected from the group consisting of 

3 OP-1 finger 1 large peptide [SEQ. ID. No, 16, residues 5 through 28], 

4 OP-1 finger 1 medium peptide [SEQ. ED. No. 16, residues 7 through 26], and 

5 OP-1 finger 1 small peptide [SEQ. ID. No. 16, residues 1 1 through 22]; 

6 the heel region comprises a sequence selected from the group consisting of 

7 OP-1 heel large peptide [SEQ. ID. No. 16, residues 49 through 64], 

8 OP-1 heel medium peptide [SEQ. ID. No. 16, residues 52 through 64], and 
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9 OP-1 heel small pq)tide [SEQ. ED. No. 16, residues 56 through 62]; and 

1 0 the finger 2 region comprises a sequence seleaed fi-om the group consisting of 

1 1 OP-1 finger 2 large peptide [SEQ. ID. No. 16, residues 75 through 91], 

12 OP-1 finger 2 medium pq)tide [SEQ. ID. No. 16, residues 77 through 89], and 

13 OP-I finger 2 small pqmde [SEQ. ID. No. 1 6, residues 79 through 87]. 

1 20. The construct of claim 1 or 4, wherein said finger I region, said heel region, and said 

2 finger 2 region are joined by a pair of polypq>tide linkers to produce said single-chain amino acid 

3 construct. 

1 21 . The construct of daim 20, wherdn each of said link^ comprises between 3 and 13 

2 amino adds. 

1 22. The construa of daim 1 , wherein said regions are defined by amino add sequences 

2 selected firom the group consisting of 

3 (a) Fl-linker.F2-linker.H; 

4 (b) Fl -linker-H-linker-F2; 

5 (c) F2-linker-Fl-linker-H; 

6 (d) F2-linkcr-H-linker-Fl; 

7 (e) H-linker-Fl -linker-F2; and 

8 (f) H.linker.F2-linker-Fl, 

9 wherein H comprises an alpha helical region comprising said heel region, Fl comprises a 

10 pair of anti-parallel P strand structures defining said finger 1 region, F2 comprises a pair of anti- 

1 1 parallel P strands structures defining said finger 2 region, each finger and heel region begins and 

12 ends with a cysteine residue, each linker comprises an amino acid sequence having 3-13 amino 

13 adds, and wherein group a) and b) optionally fiirther comprise an N-terminal sequence. 

1 23. The construct of claim 22, wherein Fl comprises, in order from the N to C direction, a 

2 first P strand, a helical loop, a loop, and a second P strand antiparallel to the first P strand. 

1 24. The construct of claim 22, wherein F2 comprises, in order fi-om the N to C direction, a 

2 first P strand, a loop, and a second P strand anti-parallel to the first P strand. 

1 25. The DNA encoding the amino acid sequence of the construct of claim 1, 4, or 22. 



26. A cell line expressing a DNA of claim 25. 
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FIG. 2 A 




FIG. 2B 
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FIG. 7 A 



Finger 1 Fragment: Large Peptide 
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FIG. 7B 

Finger 1 Fragment: Medium Peptide 
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FIG. 8B 



Heel Fragment: Medium Peptide 
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Heel Fragment: Small Peptide 
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FIG. 9 A 



Finger 2 Fragment: Large Peptide 
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Finger 2 Fragment: Medium Peptide 
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FIG. 9C 



Finger 2 Fragment: Small Peptide 
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